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Title of the Invention 

Avian Transgenesis using a chicken Ovalbumin Gene Region 

5 The present application claims priority from U.S. provisional patent 

applications, Serial Nos. 60/462,953, filed April. 15, 2003; 60/465,215, filed April 24, 
2003; and 60/469,488 filed May 9, 2003, all of which are hereby incorporated by 
reference herein in their entireties. 

10 Field of the Invention 

The present invention relates generally to an isolated nucleic acid molecule 
comprising an avian ovalbumin transcriptional regulatory control region and linked 
matrix attachment regions. The invention further relates to recombinant nucleic acids 
and expression vectors* genetically transformed cells and transgenic avians that 
15 comprise an avian ovalbumin transcriptional regulatory region operably linked to a 
heterologous polypcptide-encoding nucleic acid insert. The present invention also 
relates to the expression and production of die polypeptide-encoding nucleic acid 
molecule under the control of the isolated avian ovalbumin transcriptional regulatory 
region, 

20 Background 

Transgenic technology to convert animals into "bioreactor" for the production 
of specific proteins or other substances of pharmaceutical interest CGordon et aL , 
1987, Biotechnology 5: 1183-1 187; Wilrnut etai . 1990, Theriogenology 33: 1 13-123) 
offers significant advantages over more conventional methods of protein production 

25 by gene expression. 

Recombinant nucleic acid molecules have been engineered so that an 
expressed heterologous protein may be joined to a protein or peptide that allows 
secretion of the transgenic expression product into milk or urine, from which the 
protein may then be recovered. These procedures may require lactating animals, with 

30 the attendant costs of maintaining individual animals or herds of large species, such as 
cows, sheep, or goats. 



RECTIFIED SHEET (RULE 91) 



WO 2004/094640 



PCT/US2003/039244 



Historically, transgenic animals have been produced almost exclusively by 
microinjection of the fertilized egg. The pronuclei of fertilized eggs are 
microinjected in vitro with foreign, i.e., xenogeneic or allogeneic, heterologous DNA 
or hybrid DNA molecules. The microinjected fertilized eggs are then transferred to 
5 the genital tract of a pseudopregnant female (e.g., Krimpenfort et aL , U.S. Pat No. 
5,175,384). 

One system that holds potential is the avian reproductive system. The 
production of an avian egg begins with formation of a large yolk in the ovary of the 
hen. The unfertilized oocyte or ovum is positioned on top of the yolk sac. After 

1 0 ovulation, the ovum passes into the infundibulum of the oviduct where it is fertilized 
if sperm are present, and then moves into the magnum of the oviduct, which is lined 
with tubular gland cells. These cells secrete the egg-white proteins, including 
ovalbumin, lysozyme, ovomucoid, conalbumin and ovomucin, into the lumen of the 
magnum where they are deposited onto the avian embryo and yolk. 

1 5 The hen oviduct offers outstanding potential as a protein bioreactor because of 

the high levels of protein production, the promise of proper folding and post- 
translation modification of the target protein, the ease of product recovery, and the 
shorter developmental period of chickens compared to other potential animal species. 
The chicken ovalbumin gene is highly expressed in the tubular glands of the mature 

20 hen oviduct and is therefore a suitable candidate for an efficient promoter for 
heterologous protein production in transgenic birds. Efforts have been made to create 
transgenic chickens expressing heterologous proteins in the oviduct by means of 
microinjection of DNA (PCT Publication WO 97/47739). 

Gene expression must be considered not only from the perspective of cis- 

25 regulatory elements associated with a gene, and their interactions with trans-acting 
elements, but also with regard to the genetic environment in which they are located. 
Chromosomal positioning effects result in variations in levels of transgene expression 
associated with different locations of the transgene within the recipient genome. An 
important factor governing the level of transgene expression is the chromatin 

30 structure around a transgene, and how it cooperates with the cis-regulatory elements. 
While the deletion of a cis-regulatory element from a transgenic lysozyme locus can 
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be sufficient to reduce or eliminate positional independence of the level of gene 
expression , there is also evidence that positional independence conferred on a 
transgene requires the cotransfer of many kilobases of DNA other than just the 
protein encoding region and the immediate cis-transcriptional regulatory elements. 
5 Scattered throughout the chicken genome, including the chicken ovalbumin 

locus, are short sequences that resemble features of Long Terminal Repeats (LTRs) of 
retrovirus. The function of these elements is unclear but most likely may help define 
the DNAse hypersensitive (DHS) regions of a gene locus ( Stein et al > 1983, Proc. 
Natl Acad. Set U.S.A. 80: 6485-6489). Thus, flanking various avian genes are matrix 

10 attachment regions (5' and 3' MARs), alternatively referred to as "scaffold attachment 
regions" or SARs. The outer boundaries of the chicken lysozyme locus, for example, 
have been defined by the MARs ( Phi- Van et al ., 1988, E.M.B.OJ. 7: 655-664; Phiz 
Van & Stratling ., 1996, Biochem. 35: 10735-10742). Deletion of a 1.32 kb or a 1.45 
kb region, each comprising half of a 5' MAR, reduces positional variation in the level 

15 of transgene expression ( Phi-Van & Stratling , supra). 

The 5' matrix attachment region (5' MAR), located about -1 1 .7 kb upstream of 
the chicken lysozyme transcription start site, can increase the level of gene expression 
by limiting the chromosomal positional effects exerted against a transgene ( Phi-Van 
etal., 1988, supra). At least one other MAR is located 3 ! downstream of the protein 

20 encoding region. Although MAR nucleic acid sequences are conserved, little cross- 
hybridization is seen, indicating significant overall sequence variation. However, 
MARs of different species can interact with the nucleomatrices of heterologous 
species, to the extent, for example, that the chicken lysozyme MAR can associate 
with the plant tobacco nucleomatrix as well as that of the chicken oviduct cells 

25 ( Mlvnarona et al < 1994, Cell 6: 417-426; von Kries et gL 9 1990, Nucleic Acids Res. 
18: 3881-3885). The lysozyme promoter region of chicken is also active when 
transfected into mouse fibroblast cells and linked to a reporter gene such as the 
bacterial chloramphenicol acetyltransferase gene. In each case, the presence of a 5* 
MAR element increased positional independency of the level of transcription ( Stief et 

30 al, 1989, Nature 341: 343-345; Siunel et al , pgs. 257 - 265 in Houdeline L.M. (ed), 
"Transgenic Animals: Generation and Use"). 
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The ability to direct the insertion of a transgene into a site in the genome of an 
animal where the positional effect is limited offers predictability of results during the 
development of a desired transgenic animal, and increased yields of the expressed 
product. Sip pel and Steif disclose, in U.S. Patent No. 5,731,178, methods to increase 
5 the expression of genes introduced into eukaryotic cells by flanking a transcription 
unit with scaffold attachment elements, in particular the 5' MAR isolated from the 
chicken lysozyme gene. The transcription unit disclosed by Sip pel and Steif was an 
artificial construct that combined only the -6.1 kb enhancer element and the proximal 
promoter element (base position -579 to +15) from the lysozyme gene. Other 
10 promoter associated elements were not included. 

Although individual cis-transcriptional regulatory elements associated with 
the chicken ovalbumin gene have been isolated and sequenced, together with short 
regions of flanking DNA, the entire nucleic acid sequence comprising the 5 f upstream 
region of the ovalbumin gene has not been determined and has not been employed as 
15 a functional promoter to allow expression of a heterologous transgene. 

What are still needed, however, are efficient transcription promoters that 
allow expression of transgenes in avian cells but with reduced positional variation. 

What is also still needed is a gene expression promoter cassette that will allow 
expression of a transgene in the oviduct cells of an avian and efficient gene 
20 expression regardless of the chromosomal location of the expression system. 

Summary of the Invention 

Briefly described, the present invention relates to novel isolated and 
recombinant nucleic acid molecules that comprise an avian ovalbumin transcriptional 
25 regulatory region and at least one matrix attachment region element. 

The isolated and recombinant nucleic acid molecules of the present invention, 
because of the presence of at least one matrix attachment region, are useful for 
reducing chromosomal positional effects on a transgene operably linked to the 
ovalbumin transcriptional regulatory region and transfected into a recipient cell. 
30 Isolating an approximately 195 kb region of the chicken genome that includes regions 
upstream of the ovalbumin locus ensures that cis-elements are also included that will 
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allow gene expression in a tissue-specific manner. The ovalbumin promoter region of 
the present invention, therefore, will allow expression of an operably linked 
heterologous nucleic acid insert by a transfected avian cell such as, for example, a 
somatic cell. 

5 The present invention provides a novel isolated nucleic acid molecule of 

approximately 195 kb of the chicken genome, and truncated variants thereof, 
comprising a region of about 135 kb that is 5' upstream, and an approximately 45 kb 
region that is 3' downstream, of the ovalbumin-encoding region of the gene locus. 
The novel isolated chicken nucleic acid sequence includes matrix attachment regions 

10 both 5' and 3' of the ovalbumin gene and an ovalbumin transcriptional regulatory 
region that includes CR1 repeat elements, a proximal ovalbumin promoter. 
Interspersed among the elements are stretches of nucleic acid that serve at least to 
organize the elements in an ordered array. The novel isolated chicken genomic 
region also includes the ovalbumin-encoding region with a plurality of introns 

1 5 dispersed therein. 

The present invention further provides recombinant nucleic acid molecules for 
operably linking an avian ovalbumin transcriptional regulatory region to a 
heterologous nucleic acid molecule insert encoding a polypeptide to be expressed by 
a transfected or transgenic cell. The heterologous nucleic acid molecule may be 

20 placed in frame with a signal peptide sequence. Translation initiation may start with 
the signal peptide and continue through the nucleic acid molecule to produce an 
expressed polypeptide having the desired amino acid sequence. 

The sequence of the expressed heterologous nucleic acid insert may be 
optimized for codon usage by a host cell using approaches well known in the art. For 

25 example, codon usage may be optimized for an avian such as a chicken. This could 
be determined from the codon usage of at least one, and preferably more than one, 
protein expressed in a chicken cell. For example, the codon usage may be determined 
from the nucleic acid sequences encoding the proteins ovalbumin, lysozyme, 
ovomucin and ovotransferrin of chicken. 

30 The recombinant nucleic acid molecules of the present invention may further 

comprise a polyadenylation signal sequence that allows transcription directed by an 
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ovalbumin transcriptional regulatory region to extend beyond the heterologous 
nucleic acid encoding a desired heterologous polypeptide and to comprise a 3 ! 
untranslated region and a polyadenylated tail. Any suitable functional 
polyadenylation signal sequence may be linked to the 3' end of the heterologous 
5 nucleic acid insert, including the SV40 polyadenylation signal sequence, bovine 
growth hormone adenylation sequence or the like. 

The recombinant nucleic acid molecules of the present invention may also 
comprise a chicken ovalbumin 3' domain. The 3* domain can include a 3' 
untranslated region of the ovalbumin gene, a polyadenylation signal and at least one 

10 MAR that, in combined action with an MAR upstream of the ovalbumin 
transcriptional regulatory region, may reduce positional variation in gene expression 
in transgenic avians. 

Yet another aspect of the present invention is expression vectors suitable for 
delivery to a recipient cell, preferably an avian cell. The expression vectors provided 

15 by the present invention may comprise an avian ovalbumin transcriptional regulatory 
region that can be operably linked to a nucleic acid insert encoding a polypeptide, and 
optionally a polyadenylation signal sequence. The expression vectors of the present 
invention further comprise at least one MAR element, and preferably two MARs that 
flank the ovalbumin transcriptional regulatory region and which can non-randomly 

20 direct the insertion of the expression vector into the genome of a recipient eukaryotic 
cell. The expression vector may further comprise a bacterial plasmid sequence, a 
viral nucleic acid sequence, or fragments or variants thereof that may allow for 
replication of the vector in a suitable host. 

Another aspect of the present invention is methods of expressing a 

25 heterologous polypeptide in a eukaryotic cell by transfecting the cell with a 
recombinant nucleic molecule comprising an avian ovalbumin transcriptional 
regulatory region operably linked to a nucleic acid insert encoding a polypeptide 
desired to be expressed and, optionally, a polyadenylation signal sequence, and 
culturing the transfected cell under conditions suitable for expression of the 

30 heterologous polypeptide under the control of the avian ovalbumin transcriptional 
regulatory region. 
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Also within the scope of the present invention are recombinant cells, tissues 
and animals containing non-naturally occurring recombinant nucleic acid molecules 
according to the present invention as described above. In one embodiment of the 
present invention, the transformed cell is a chicken oviduct cell and the nucleic acid 
5 insert comprises the chicken ovalbumin transcriptional regulatory region, a nucleic 
acid insert encoding a human interferon a2b that is codon optimized for expression in 
an avian cell, and an SV40 polyadenylation sequence. In another embodiment of the 
present invention, the nucleic acid insert encodes the heavy and light chains of an 
antibody. 

10 Additional objects and aspects of the present invention will become more 

apparent upon review of the detailed description set forth below when taken in 
conjunction with the accompanying figures, which are briefly described as follows. 

Brief Description of the Figures 

15 Fig. 1 illustrates the nucleic acid sequence SEQ ID NO: 1 of a region of the 

chicken genome that includes a chicken ovalbumin transcriptional regulatory region 
and the chicken ovalbumin gene, and matrix attachment regions 5' upstream and 3' 
downstream thereof. 

Fig. 2 schematically illustrates the chicken genomic region having nucleic 
20 acid sequence SEQ ID NO: 1, indicating the relative positions and orientations of 
regions having identity with known domains. 

Fig. 3 illustrates schematically the construction of an expression bacterial 
artificial chromosome where the insert gene of interest is under the expression control 
of the chicken ovalbumin promoter. Genes of interest may be inserted into the native 
25 translation start site of the ovalbumin gene. L and roman numerals, ovalbumin exons; 
GOI, gene of interest; start, translation start site; stop, translation stop site; pA, 
polyadenylation signal; E, EcoRl site. 

Fig. 4 illustrates an SV40 polyadenylation signal sequence SEQ ID NO: 2. 
Fig. 5 illustrates the nucleotide sequence SEQ ID NO: 3 of a human interferon 
30 a2b interferon optimized for expression in an avian cell. 
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Fig. 6 illustrates the reconstruction of the chicken genomic region containing 
the ovalbumin locus. 

Detailed Description of the Preferred Embodiments 

5 This description uses gene nomenclature accepted by the Cucurbit Genetics 

Cooperative as it appears in the Cucurbit Genetics Cooperative Report 18:85 (1995), 
which are incorporated herein by reference in its entirety. Using this gene 
nomenclature, genes are symbolized by italicized Roman letters. If a mutant gene is 
recessive to the normal type, then the symbol and name of the mutant gene appear in 
10 italicized lower case letters. 

For convenience, definitions of certain terms employed in the specification, 
examples, and appended claims are collected here. 
Definitions 

The term "avian" as used herein refers to any species, subspecies or race of 
15 organism of the taxonomic class ava, such as, but not limited to chicken, turkey, 
duck, goose, quail, pheasants, parrots, finches, hawks, crows and ratites including 
ostrich, emu and cassowary. The term includes the various known strains of Gallus 
gallus, or chickens, (for example, White Leghorn, Brown Leghorn, Barred-Rock, 
Sussex, New Hampshire, Rhode Island, Ausstralorp, Minorca, Amrox, California 
20 Gray, Italian Partidge-colored), as well as strains of turkeys, pheasants, quails, duck, 
ostriches and other poultry commonly bred in commercial quantities. It also includes 
an individual avian organism in all stages of development, including embryonic and 
fetal stages. The term "avian" also may denote "pertaining to a bird", such as "an 
avian (bird) cell." 

25 The term "nucleic acid" as used herein refers to any natural or synthetic linear 

and sequential array of nucleotides and nucleosides, for example cDNA, genomic 
DNA, mRNA, tRNA, oligonucleotides, oligonucleosides and derivatives thereof. For 
ease of discussion, such nucleic acids may be collectively referred to herein as 
"constructs," "plasmids," or "vectors." The term "nucleic acid" further includes 

30 modified or derivatized nucleotides and nucleosides such as, but not limited to, 
halogenated nucleotides such as, but not only, 5-bromouracil, and derivatised 
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nucleotides such as biotin-labeled nucleotides. 

The term "isolated nucleic acid molecule" as used herein refers to a nucleic 
acid molecule with a structure not identical to a naturally occurring nucleic acid 
molecule and includes DNA, RNA, or derivatives or variants thereof. The term 
5 covers, but is not limited to, (a) a DNA which has the sequence of part of a naturally 
occurring genomic molecule but is not flanked by at least one of the coding sequences 
that flank that part of the molecule in the genome of the species in which it naturally 
occurs; (b) a nucleic acid incorporated into a vector or into the genomic nucleic acid 
of a prokaryote or eukaryote in a manner such that the resulting molecule is not 

1 0 identical to any vector or naturally occurring genomic DNA; (c) a separate molecule 
such as a cDNA, a genomic fragment, a fragment produced by polymerase chain 
reaction (PCR), ligase chain reaction (LCR) or chemical synthesis, or a restriction 
fragment; (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a 
gene encoding a fusion protein, and (e) a recombinant nucleotide sequence that is part 

15 of a hybrid sequence that is not naturally occurring. Isolated nucleic acid molecules 
of the present invention can include, for example, natural allelic variants as well as 
nucleic acid molecules modified by nucleotide deletions, insertions, inversions, or 
substitutions such that the resulting nucleic acid molecule still essentially encodes an 
ovalbumin transcriptional regulatory region or a variant thereof of the present 

20 invention. 

The terms "polynucleotide," "oligonucleotide," and "nucleic acid sequence" 
are used interchangeably herein and include, but are not limited to, coding sequences 
(polynucleotide(s) or nucleic acid sequence(s) which are transcribed and translated 
into polypeptide in vitro or in vivo when placed under the control of appropriate 

25 regulatory or control sequences); control sequences (e.g., translational start and stop 
codons, promoter sequences, ribosome binding sites, polyadenylation signals, 
transcription factor binding sites, transcription termination sequences, upstream and 
downstream regulatory domains, enhancers, silencers, and the like); and regulatory 
sequences (DNA sequences to which a transcription factor(s) binds and alters the 

30 activity of a gene's promoter either positively (induction) or negatively (repression)). 
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No limitation as to length or to synthetic origin are suggested by the terms described 
above. 

As used herein the terms "peptide," "polypeptide" and "protein" refer to a 
polymer of amino acids in a serial array, linked through peptide bonds. A "peptide" 
5 typically is a polymer of at least two to about 30 amino acids linked in a serial array 
by peptide bonds. The term "polypeptide" includes proteins, protein fragments, 
protein analogues, oligopeptides and the like. The term "polypeptides" contemplates 
polypeptides as defined above that are encoded by nucleic acids, produced through 
recombinant technology (isolated from an appropriate source such as a bird), or 

10 synthesized. The term "polypeptides" further contemplates polypeptides as defined 
above that include chemically modified amino acids or amino acids covalently or 
noncovalently linked to labeling moieties. 

The term "fragment" as used herein refers to any isolated portion of the 
subject nucleic acid molecule constructed artificially (e.g., by chemical synthesis) or 

15 by cleaving a natural product into multiple pieces, using restriction endonucleases or 
mechanical shearing, or a portion of a nucleic acid synthesized by DNA polymerase, 
including by PGR, or any other polymerizing technique well known in the art, or 
expressed in a host cell by recombinant nucleic acid technology well known to one of 
skill in the art. The term "fragment" as used herein may also refer to an isolated 

20 portion of a polypeptide, wherein the portion of the polypeptide is cleaved from a 
naturally occurring polypeptide by proteolytic cleavage by at least one protease, or is 
a portion of the naturally occurring polypeptide synthesized by chemical or 
recombinant methods well known to one of skill in the art. 

The terms "recombinant nucleic acid" and "recombinant DNA" as used herein 

25 refer to combinations of at least two nucleic acid sequences that are not naturally 
found in a eukaryotic or prokaryotic cell. The nucleic acid sequences may include, 
but are not limited to, nucleic acid vectors, gene expression regulatory elements, 
origins of replication, suitable gene sequences that when expressed confer antibiotic 
resistance, protein-encoding sequences and the like. The term "recombinant 

30 polypeptide" is meant to include a polypeptide produced by recombinant DNA 
techniques. A recombinant polypeptide may be distinct from a naturally occurring 
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polypeptide either in its location, purity or structure. Generally, a recombinant 
polypeptide will be present in a cell in an amount different from that normally 
observed in nature. 

The term "gene" or "genes" as used herein refers to nucleic acid sequences 
5 that encode genetic information for the synthesis of a whole RNA, a whole protein, or 
any portion of such whole RNA or whole protein. Genes that are not naturally part of 
a particular organism's genome are referred to as "foreign genes," "heterologous 
genes" or "exogenous genes" and genes that are naturally a part of a particular 
organism's genome are referred to as "endogenous genes". The term "gene product" 

10 refers to an RNA or protein that is encoded by the gene. "Endogenous gene products" 
are RNAs or proteins encoded by endogenous genes. "Heterologous gene products" 
are RNAs or proteins encoded by "foreign, heterologous or exogenous genes" and 
are, therefore, not naturally expressed in the cell. 

The term "expressed" or "expression" as used herein refers to the transcription 

1 5 from a gene to give an RNA nucleic acid molecule at least complementary in part to a 
region of one of the two nucleic acid strands of the gene. The term "expressed" or 
"expression" as used herein may also refer to the translation from an RNA molecule 
to give a protein, a polypeptide or a portion thereof. 

As used herein, the term "locus" refers to the site of a gene on a chromosome. 

20 In diploid organisms, pairs of genes control hereditary traits, each in the same 
position on a pair of chromosomes. These gene pairs, or alleles, may both be 
dominant or both be recessive in expression of that trait. In either case, the individual 
is said to be homozygous for the trait controlled by that gene pair. If the gene pair 
(alleles) consists of one dominant and one recessive trait, the individual is 

25 heterozygous for the trait controlled by the gene pair. 

The term "operably linked" refers to an arrangement of elements wherein the 
components so described are configured so as to perform their usual function. 
Control sequences operably linked to a coding sequence are capable of effecting the 
expression of the coding sequence. The control sequences need not be contiguous 

30 with the coding sequence, so long as they function to direct the expression thereof. 
For example, intervening untranslated yet transcribed sequences can be present 
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between a promoter sequence and the coding sequence and the promoter sequence 
can still be considered "operably linked" to the coding sequence. 

The term "transcription regulatory sequences" as used herein refers to 
nucleotide sequences that are associated with a gene nucleic acid sequence and which 
5 regulate the transcriptional expression of the gene. Exemplary transcription 
regulatory sequences include enhancer elements, hormone response elements, steroid 
response elements, negative regulatory elements, and the like. 

The term "promoter" as used herein refers to the DNA sequence that 
determines the site of transcription initiation by an RNA polymerase. A "promoter- 

10 proximal element" is a regulatory sequence generally within about 200 base pairs of 
the transcription start site. 

The term "matrix attachment region" as used herein refers to a region of a 
eukaryotic genomic DNA that can be bound to chromosomal scaffold proteins. 
Matrix (scaffold) attachment regions (MARs) are generally located between 

15 transcription units such that the transcription units are within chromosomal loops. 
The bases of the loops are connected to the scaffold proteins through the MAR at 
each base. MARs and MAR-like homologs are identified as several recognizable 
nucleic acid sequences including, but not limited to, TG-rich spans, AT-rich regions 
and consensus sequences as described by Wang et ah J. Biol Che?n. 270:23239- 

20 23242 (1995). MARs may be identified by using suitable software such as, for 
example, MAR-WIZ™ (Futuresoft, Michigan, USA) 

The term "internal ribosome entry sites (IRES)" as used herein refers to a 
region of a nucleic acid, most typically an RNA molecule, wherein eukaryotic 
initiation of protein synthesis occurs far downstream of the 5' end of the RNA 

25 molecule. A 43S pre-initiation complex comprising the elf2 protein bound to GTP 
and Met-tRNAj Met , the 40S ribosomal subunit, and faction elf3 and 31fl A may bind to 
an "IRES" before locating an AUG start codon. An "IRES" may be used to initiate 
translation of a second coding region downstream of a first coding region, wherein 
each coding region is expressed individually, but under the initial control of a single 

30 upstream promoter. An "IRES" may be located in a eukaryotic cellular mRNA. 

The term "coding region" as used herein refers to a continuous linear 
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arrangement of nucleotides which may be translated into a polypeptide. A full length 
coding region is translated into a full length protein; that is, a complete protein as 
would be translated in its natural state absent any post-translational modifications. A 
full length coding region may also include any leader protein sequence or any other 
5 region of the protein that may be excised naturally from the translated protein. 

The terms "complementary", "complementarity" or "complement" as used 
herein refers to two nucleic acid molecules that can form specific interactions with 
one another to form a base-paired double helix. 

The term "probe" as used herein, when referring to a nucleic acid, refers to a 

10 nucleotide sequence that can be used to anneal or hybridize with and thereby identify 
the presence of a complementary sequence, or a complementary sequence differing 
from the probe sequence but not to a degree that prevents hybridization under the 
hybridization stringency conditions used. The probe may be modified with labels 
such as, but not only, radioactive groups, biotin, and the like that are well known in 

1 5 the art. 

The term "hybridizing under stringent conditions" as used herein refers to 
annealing a first nucleic acid to a second nucleic acid under stringent conditions as 
defined below. Stringent hybridization conditions typically permit the hybridization 
of nucleic acid molecules having at least 70% nucleic acid sequence complementarity 

20 with the nucleic acid molecule being used as a probe in the hybridization reaction, 
e.g., high temperature and/or low salt content that tend to disfavor hybridization of 
dissimilar nucleotide sequences. Alternatively, hybridization of the first and second 
nucleic acid may be conducted under reduced stringency conditions, e.g., low 
temperature and/or high salt content that tend to favor hybridization of dissimilar 

25 nucleotide sequences. Low stringency hybridization conditions may be followed by 
high stringency conditions or intermediate medium stringency conditions to increase 
the selectivity of the binding of the first and second nucleic acids. The hybridization 
conditions may further include reagents such as, but not limited to, dimethyl sulfoxide 
(DMSO) or formamide to disfavor still further the hybridization of dissimilar 

30 nucleotide sequences. A suitable hybridization protocol may, for example, involve 
hybridization in 6X SSC (wherein IX SSC comprises 0.015 M sodium citrate and 
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0.15 M sodium chloride), at 65° Celsius in an aqueous solution, followed by washing 
with IX SSC at 65° Celsius. Formulae to calculate appropriate hybridization and 
wash conditions to achieve hybridization permitting 30% or less mismatch between 
two nucleic acid molecules are disclosed, for example, in Meinkoth et al * 1984, Anal 
5 Biochem. 138: 267-284; the content of which is incorporated herein by reference in its 
entirety. Protocols for hybridization techniques are well known to those of skill in the 
art and standard molecular biology manuals may be consulted to select a suitable 
hybridization protocol without undue experimentation. See, for example, Sambrook 
et aL, 1989, "Molecular Cloning: A Laboratory Manual", 2nd ed., Cold Spring 
10 Harbor Press, the contents of which are herein incorporated by reference in its 
entirety. 

Typically, stringent conditions will be those in which the salt concentration is 
less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or 
other salts) from about pH 7.0 to about pH 8.3 and the temperature is at least about 

15 30° Celsius for short probes (e.g., 10 to 50 nucleotides) and at least about 60° Celcius 
for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be 
achieved with the addition of destabilizing agents such as formamide. Exemplary low 
stringency conditions include hybridization with a buffer solution of 30 to 35% 
formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 37° Celsius, and a wash 

20 in lx to 2x SSC at 50 to 55° Celsius. Exemplary moderate stringency conditions 
include hybridization in 40 to 45% formamide, 1 M NaCl, 1% SDS at 37° Celsius, 
and a wash in 0.5x to lx SSC at 55 to 60° Celsius. Exemplary high stringency 
conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° 
Celsius, and a wash in 0.1 x SSC at 60 to 65° Celsius. 

25 The terms "percent sequence identity" as used herein refers to the degree of 

sequence identity between two nucleic acid sequences or two amino acid sequences 
as determined using the algorithm of Karlin & Attschul, 1990, Proc. Natl. Acad. Sci, 
87: 2264-2268, modified as in Karlin & Attschuk 1993, Proc, Natl Acad. Sci. 90: 
5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST 

30 programs of Attschul et aL , 1990, J. Mol. Biol Q15: 403-410. BLAST nucleotide 
searches are performed with the NBLAST program, score = 100, wordlength = 12, to 
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obtain nucleotide sequences homologous to a nucleic acid molecule of the invention. 
BLAST protein searches are performed with the XBLAST program, score = 50, 
wordlength = 3, to obtain amino acid sequences homologous to a reference 
polypeptide. To obtain gapped alignments for comparison purposes, Gapped BLAST 
5 is utilized as described in Attschul et al . 1997, Nucl Acids Res. 25: 3389-3402. 
When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g. XBLAST and NBLAST) are used. Other algorithms, 
programs and default settings may also be suitable such as, but not only, the GCG- 
Sequence Analysis Package of the U.K. Human Genome Mapping Project Resource 

10 Centre that includes programs for nucleotide or amino acid sequence comparisons. 

The terms "vector" or "nucleic acid vector" as used herein refer to a natural or 
synthetic single or double stranded plasmid or viral nucleic acid molecule (RNA or 
DNA) that can be transfected or transformed into cells and replicate independently of, 
or within, the host cell genome. The term "expression vector" as used herein refers to 

15 a nucleic acid vector that comprises a transcription regulatory region operably linked 
to a site wherein is, or can be, inserted, a nucleotide sequence to be transcribed and, 
optionally, to be expressed, for instance, but not limited to, a sequence coding at least 
one polypeptide. 

The term "transfection" as used herein refers to the process of inserting a 
20 nucleic acid into a host cell. Many techniques are well known to those skilled in the 
art to facilitate transfection of a nucleic acid into an eukaryotic cell. These methods 
include, for instance, treating the cells with high concentrations of salt such as a 
calcium or magnesium salt, an electric field, detergent, or liposome mediated 
transfection, to render the host cell competent for the uptake of the nucleic acid 
25 molecules, and by such methods as micro-injection into a pro-nucleus, sperm- 
mediated and restriction-mediated integration. 

The terms "recombinant cell" and "genetically transformed cell" refer to a cell 
comprising a combination of nucleic acid segments not found in a single cell with 
each other in nature. A new combination of nucleic acid segments can be introduced 
30 into an organism using a wide array of nucleic acid manipulation techniques available 
to those skilled in the art. A recombinant cell can be a prokaryotic cell, or a 
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eukaryotic cell, such as, but not limited to 5 an avian cell. The recombinant cell may 
harbor a vector that is extragenomic, i.e.that does not covalently insert into the 
cellular genome, including a non-nuclear (e.g. mitochondrial) genome(s). A 
recombinant cell may further harbor a vector or a portion thereof that is intragenomic, 
5 i.e. covalently incorporated within the genome (including non-nuclear genome(s)) of 
the recombinant cell. 

As used herein, a "transgenic avian" is any avian, as defined above, including 
the chicken, in which one or more of the cells of the avian contain heterologous 
nucleic acid introduced by manipulation, such as by transgenic techniques. The 

10 nucleic acid may be introduced into a cell, directly or indirectly, by introduction into 
a precursor of the cell by way of deliberate genetic manipulation, such as by 
microinjection or by infection with a recombinant virus. Genetic manipulation also 
includes classical cross-breeding, or in vitro fertilization. A recombinant DNA 
molecule may be integrated within a chromosome, or it may be extrachromosomally 

1 5 replicating DNA. 

The terms "chimeric animal" or "mosaic animal" are used herein to refer to 
animals in which the recombinant gene is found, or in which the recombinant is 
expressed, in some but not all cells of the animal. The term "tissue-specific chimeric 
animal" indicates that the recombinant gene is present and/or expressed in some 

20 tissues but not others. 

As used herein, the term "transgene" means a nucleic acid sequence that is 
partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into 
which it is introduced, or, is homologous to an endogenous gene of the transgenic 
animal or cell into which it is introduced, but which is designed to be inserted, or is 

25 inserted, into the animal's genome in such a way as to alter the genome of the cell 
into which it is inserted (e.g., it is inserted at a location which differs from that of the 
natural gene or its insertion results in a knockout). 

The term "chromosomal positional effect" as used herein refers to the 
variation in the degree of gene transcription as a function of the location of the 

30 transcribed locus within the cell genome. Random transgenesis may result in a 
transgene being inserted at different locations in the genome so that individual cells 
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of a population of transgenic cells may each have at least one transgene, each at a 
different location and therefore each in a different genetic environment. Each cell, 
therefore, may express the transgene at a level specific for that particular cell and 
dependent upon the immediate genetic environment of the transgene. In a transgenic 
5 animal, as a consequence, different tissues may exhibit different levels of transgene 
expression. The term "reduced chromosomal positioning effect" as used herein refers 
to a decreased intercellular variation in the level of gene transcription because of a 
reduction in the number of sites of insertion of a heterologous nucleic acid molecule 
into the genome of a recipient cell. Consequently, a reduced chromosomal 
10 positioning effect provides a more uniform population of genetically transformed 
cells with respect to transgene insertion sites in the cellular genomes. In transgenic 
animals, different tissues may exhibit reduced variability in the levels of transgene 
expression. 

The term "cytokine" as used herein refers to any secreted polypeptide that 

15 affects a function of cells and modulates an interaction between cells in the immune, 
inflammatory or hematopoietic response. A cytokine includes, but is not limited to, 
monokines and lymphokines. Examples of cytokines include, but are not limited to, 
interferon a2b, Interleukin-1 (IL-1), Interleukin-6 (IL-6), Interleukin-8 (IL-8), Tumor 
Necrosis Factor-a (TNF-a.) and Tumor Necrosis Factor p (TNF-p.). 

20 The term "antibody" as used herein refers to polyclonal and monoclonal 

antibodies and fragments thereof, and immunologic binding equivalents thereof. 
Antibodies may include, but are not limited to polyclonal antibodies, monoclonal 
antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab') 2 fragments, fragments produced by a Fab expression library, anti- 

25 idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 

The term "immunoglobulin polypeptide" as used herein refers to a constituent 
polypeptide of an antibody or a polypeptide derived therefrom. An "immunological 
polypeptide" may be, but is not limited to, an immunological heavy or light chain and 
may include a variable region, a diversity region, joining region and a constant region 

30 or any combination, variant or truncated form thereof The term "immunological 
polypeptides" further includes single-chain antibodies comprised of, but not limited 
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to, an immunoglobulin heavy chain variable region, an immunoglobulin light chain 
variable region and optionally a peptide linker. 

Techniques useful for isolating and characterizing the nucleic acids and 
proteins of the present invention are well known to those of skill in the art and 
5 standard molecular biology and biochemical manuals may be consulted to select 
suitable protocols without undue experimentation. See, for example, Sambrook et al , 
1989, "Molecular Cloning: A Laboratory Manual", 2nd ed., Cold Spring Harbor, the 
content of which is herein incorporated by reference in its entirety. 

10 Abbreviations 

Abbreviations used in the present specification include the following: aa, 
amino acid(s); bp, base pair(s); kb, kilobase; cDNA, DNA complementary to RNA; 
SSC, sodium chloride-sodium citrate; DMSO, dimethyl sulfoxide; MAR, matrix 
attachment region; CPE, chromosomal positioning effect; BAC, bacterial artificial 

1 5 chromosome; YAC, yeast artificial chromosome. 

The present invention provides novel isolated and recombinant nucleic acid 
molecules comprising an avian ovalbumin transcriptional regulatory region and at 
least one MAR element, which are useful as vectors for inserting a heterologous 

20 nucleic acid molecule into the genome of a recipient avian cell. The novel isolated 
nucleic acid molecules of the present invention are particularly useful for directing 
the incorporation of a heterologous nucleic acid that is under transcriptional 
regulation of an avian ovalbumin gene promoter, into the genome of a recipient avian 
cell while reducing or avoiding chromosomal positioning effects that would otherwise 

25 result from randomly distributed insertions of the heterologous nucleic acid molecule 
into the recipient avian genome. The present invention further provides methods of 
delivering a heterologous nucleic acid under the transcriptional regulation of an avian 
ovalbumin transcriptional regulatory region, to an avian cell, whereby the 
heterologous nucleic acid desired to be expressed under the associated avian 

30 ovalbumin gene transcriptional regulatory element can be integrated into an avian cell 
genome. As well as providing recombinant nucleic acids, vectors and derivatives 



- 18- 



WO 2004/094640 



PCT/US2003/039244 



thereof, the present invention provides transfected and transgenic avian cells and 
birds derived therefrom that are capable of producing a heterologous polypeptide in 
the serum or the white of a laid egg. 

Nucleic acids comprising the chicken ovalbumin gene and 5* and 3' MAR elements 
5 The novel isolated and recombinant nucleic acid molecules of the present 

invention comprise the chicken ovalbumin gene comprising transcriptional regulatory 
elements positioned 5 r upstream of the ovalbumin-encoding region of the native 
chicken ovalbumin locus and which are necessary for the regulated expression of a 
downstream polypeptide-encoding nucleic acid, and at least one MAR element. 

10 The inclusion of a MAR element, and preferably at least two MARs, in the 

same nucleic acid and flanking the ovalbumin gene region, may confer positional 
independence to a transfected gene operably linked to the ovalbumin transcriptional 
regulatory region. While not wishing to be bound by any one theory, it is believed 
that the 5 ! and 3' MARs of a transfected nucleic acid molecule of the present 

1 5 invention restrict the number of possible transgene insertion sites within the genome 
of the recipient avian cell, thereby reducing chromosomal positioning effects upon 
transcription levels. Thus the isolated novel nucleic acid molecules of the present 
invention are useful for reducing the chromosomal positional effects exerted on 
heterologous transgene expression. The heterologous transgene will be operably 

20 linked to the ovalbumin transcriptional regulatory region within a novel recombinant 
nucleic acid molecule transfected into a recipient avian cell. Included in the nucleic 
acid molecules of the present invention are a region of the avian genome 
encompassing a MAR upstream of the ovalbumin locus and cis-regulatory elements 
that may allow gene expression in a tissue-specific manner. The ovalbumin promoter 

25 region of the novel nucleic acid molecules is especially useful for directing 
expression of an operably linked heterologous nucleic acid in a transfected avian cell 
such as an avian oviduct cell. 

Also within the scope of the present invention that nucleic acid molecules 
further comprising a region of the chicken ovalbumin locus that is 3' of the 

30 ovalbumin-encoding region, or of a nucleic acid insert encoding a heterologous 
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present invention includes at least one nucleic acid sequence encoding a 3' MAR 
element which may cooperate with a 5' MAR to limit the number of sites of insertion 
into the genome of an avian cell of a transfected nucleic acid molecule. In either 
event, the directed insertion induced by one or more MARs can reduce or eliminate 
5 chromosomal positioning effects, resulting in a more uniform level of gene 
expression of the heterologous nucleic acid insert in a population of genetically 
transformed cells. 

(a) Isolated nucleic acid encompassing the chicken ovalbumin gene 

One aspect of the present invention, therefore, is a nucleic acid molecule 

10 isolated from the genome of a chicken and comprising a proximal ovalbumin 
promoter suitable for directing transcription of regulation of a transcript encoding 
ovalbumin, and 5 f and 3' MAR elements flanking the ovalbumin gene region. 

BACs 120 and 77 (ATCC Accession Nos. ) containing 

overlapping regions of the chicken genome, were sequenced and compiled as the 

15 contiguous sequence SEQ ID NO: 1. BAC 120 includes the sequence from 
nucleotide position 1 to position 157354 of SEQ ID NO: 1. The sequence of BAC 77 
begins at nucleotide position 157355 of SEQ ID NO: 1 to position 195102. The 
nucleic sequence of the 195,102 bp chicken genomic region SEQ ID NO: 1 (GenBank 
Accession No. ) is shown in Fig. 1. A schematic showing identifiable 

20 domains within SEQ ID NO: 1 that have sequence identity or homology to known 
domain families or previously identified genes mainly identified using BLAST, 
GenScan and MARWIZ software is shown in Fig. 2. BAC 26, constructed as 
described in Example 1 below and containing the entire nucleic acid insert SEQ ID 
NO: 1 less about 11.5 kb at the extreme 5' end, was deposited with American Type 

25 Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10, as ATCC 
No. PTA-5548 on September 24, 2003, under the conditions set forth in the Budapest 
Treaty. 

The nucleic acid molecule SEQ ID NO: 1 of the present invention has at least 
four MAR elements. One MAR element is 5' upstream of the ovalbumin gene, 
30 between about nucleotide positions 41701 and 41900. MAR-like elements are also 
between nucleotide positions 56001-56201, 56501-56901, 58401-58701, 76251- 
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between nucleotide positions 56001-56201, 56501-56901, 58401-58701, 76251- 
76451 and 80151-80451. Another MAR element is between about 96401-96800. 
MAR elements located 3' downstream of the ovalbumin gene are at nucleotide 
positions about 144651-144850, about 150601-151600, about 156681-157181, about 
5 157081-15781, about 163701-164100, about 186201-186590 and about 190101- 
190800 of SEQ ID NO: 1, The chicken ovalbumin gene ATG start codon is at 
nucleotide position 133372. 

Also dispersed along the nucleic acid molecule represented by SEQ ID NO: 1 
are other identifiable domains listed, for example, in Table 2 below, including several 

10 serpin- or serpin-like encoding genes, cis transcription regulatory elements of both the 
serpin-like and ovalbumin genes, and at least two other, putatively functional genes, 
X and Z. Between the various domains, genes or other elements are stretches of 
nucleotides, the functions of which may serve to maintain the position and 
configuration of the elements relative to each other. 

15 The isolated nucleic acids of the present invention and derivatives and 

truncated variants thereof may be incorporated into a vector, such as a bacterial or 
yeast artificial chromosome. The BAC cloning system ( Shizuya et aU Proc. Natl 
Acad. Sci (U.S.A.), 89:8794:8797, (1992) has been developed to stably maintain large 
fragments of genomic DNA (100-300 kb) in E. coli. An exemplary BAC vector 

20 consists of the pBeloBACl 1 vector that has been described by Kim et aL Genomics, 
34:213:218 (1996). Genomic DNA can be partially digested, for example, using 
enzymes that permit ligation into either the BamH I or Hind III sites in the vector. It 
is contemplated that any suitable restriction sites may be used that are useful for 
incorporating genomic DNA into a selected BAC vector. Flanking these cloning sites 

25 are T7 and SP6 RNA polymerase transcription initiation sites that can be used to 
generate end probes by either RNA transcription or PCR methods. BAC DNA is 
purified from the host cell as a supercoiled circle. Converting these circular 
molecules into a linear form precedes both size determination and introduction of the 
BACs into recipient avian cells. A suitable cloning site may be flanked, for example, 

30 by two Not I restriction sites, permitting cloned segments to be excised from the 
vector by Not I digestion. Alternatively, the BAC vector may be linearized by 
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treatment with the commercially available enzyme lambda terminase that leads to the 
cleavage at a cosN site. However, this cleavage method results in a full length BAC 
clone containing both the insert DNA and the BAC vector sequences. 

One embodiment of the novel isolated nucleic acid molecules of the present 
5 invention, therefore, is an isolated chicken nucleic acid molecule encoding an 
ovalbumin transcriptional regulatory region and a 5 1 MAR. In one embodiment of the 
present invention, the novel isolated nucleic acid molecule further comprises a 3 f 
MAR downstream of the ovalbumin gene. The isolated nucleic acid molecules of the 
present invention may also include nucleic acid elements such as, but not limited to, a 

10 transcription enhancer element, a negative regulator element, a hormone responsive 
element, an avian CR1 repeat element that together may constitute, in whole or in 
part, the ovalbumin transcriptional regulatory region, a proximal ovalbumin promoter 
and a signal peptide-encoding region. There are also stretches of nucleic acid 
between these constituent elements that organize the various elements into an ordered 

1 5 linear array. While the constituent elements of the ovalbumin transcriptional control 
region are preferably ordered as in sequence SEQ ID NO: 1, it is within the scope of 
the present invention for the cis-elements of the ovalbumin transcriptional regulatory 
region to be in any linear arrangement that will allow the formation of a transcript 
comprising the nucleotide sequence, or its complement, of a nucleic acid insert 

20 operably linked to the ovalbumin transcriptional regulatory region. 

The novel isolated nucleic acid molecules of the present invention allow one 
skilled in the art to, for example, (a) make copies of those nucleic acid molecules by 
procedures such as, but not limited to, insertion into a cell for replication by the cell, 
by chemical synthesis or by procedures such as PGR or LCR, (b) obtain nucleic acid 

25 molecules which include at least a portion of such nucleic acid molecules, including 
full-length genes, full-length coding regions, transcriptional regulatory sequences, 
truncated coding regions and the like, (c) identify and obtain ovalbumin 
transcriptional regulatory region homologs found in other avian species such as, but 
not limited to, turkey, duck, goose, quail, pheasant, parrot, finch, ratites including 

30 ostrich, emu and cassowary and, (d) to obtain isolated nucleic acids capable of 
hybridizing to an avian ovalbumin transcriptional regulatory region nucleic acid and 
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of being used as a probe to detect the presence of nucleic acid-related sequences by 
complementation between the probe and the target nucleic acid. 

Such nucleic acid homologs can be obtained in a variety of ways including 
using traditional cloning techniques to screen appropriate libraries, amplifying 
5 appropriate libraries or DNA using oligonucleotide primers derived from the novel 
nucleic acid molecules of the present invention in a polymerase chain reaction or 
other amplification method, and screening public and/or private databases containing 
genetic sequences using nucleic acid sequences of the present invention to identify 
targets. Examples of preferred libraries to screen, or from which to amplify nucleic 
10 acid molecules, include but are not limited to avian BAC libraries, genomic DNA 
libraries, and cDNA libraries. Similarly, preferred sequence databases useful for 
screening to identify sequences in other species homologous to chicken ovalbumin 
transcriptional regulatory region include, but are not limited to, GenBank and the 
mammalian Gene Index database of The Institute of Genomics Research (TIGR). 
1 5 Nucleotides used to construct the nucleic acids of the present invention can be 

labeled to provide a signal as a means of detection, using conventional labeling 
technologies such, as radioactive labels, fluorescent compounds, enzymes and 
chemiluminescent moieties. Methods useful in selecting appropriate labels and 
binding protocols for binding the labels to the synthetic nucleotides are well known to 
20 those of skill in the art. 

In one embodiment of the isolated nucleic acid molecule according to the 
present invention, the nucleic acid is isolated from a chicken. 

In other embodiments of the isolated nucleic acid molecule according to the 
present invention, the nucleic acid molecule comprises a nucleotide sequence having 
25 at least 80% identity, at least 95% identity or at least 99% identity to the nucleotide 
sequence according to SEQ ID NO: 1, or the complement thereof. 

In other embodiments, the isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence according to SEQ ID NO: 1 or has the nucleotide 
sequence according to SEQ ID NO: 1. In another embodiment, the isolated nucleic 
30 acid molecule can be an allelic variant of SEQ ID NO: 1. 
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(b) Fragments and Variants of SEQ ID NO: 1 

Fragments of the isolated nucleic acid molecules of the present invention also 
are within the scope of the present invention. As used herein, a fragment of a nucleic 
acid molecule refers to a nucleotide sequence having fewer nucleotides than the 
5 nucleotide sequence SEQ ID NO: 1 but which includes a nucleic acid sequence of the 
ovalbumin transcriptional regulatory region able to direct and regulate transcription of 
a nucleic acid, and at least one MAR element. 

The isolated nucleic acid molecule having the sequence SEQ ID NO: 1 may 
be reduced in size by truncating regions that do not affect the expression of a 

10 heterologous nucleic acid placed under the transcriptional control of the ovalbumin 
transcription regulatory region. A truncated variant of the nucleic acid molecule of 
the present invention is understood to be any variant of SEQ ID NO: 1 less 
nucleotides at either the 5' and/or the 3' end of SEQ ID NO: 1. For example, it is 
contemplated that any of the nucleotides from positions 1- about 40500 may be 

15 individually, in part, or in total, deleted from the variant nucleic acid molecule. 
Similarly, nucleotides from positions about 151700, 164200, 186690 or 190900 to 
195101 of the nucleic acid molecule having sequence SEQ ID NO: 1 may be 
removed, retaining 1, 2, 3, 4 or more of the MAR or MAR-like elements respectively 
located 3 1 at the chicken ovalbumin gene. Useful truncated variants of SEQ ID NO: 

20 1, therefore, include, but are not limited to, from base position about 41000 to about 
191500, to about 187000, to about 164500, to about 152000, or to about 145500 and 
from base position about 96000 to about 191500, to about 187000, to about 164500, 
to about 152000 or to about 145500. Other useful truncated variants of SEQ ID NO: 
1 include regions from nucleotide positions about 56000, about 58350, about 76200 

25 and about 80000 to about 191500, to about 187000, to about 164500, to about 152000 
or to about 145500. 

Therefore, the invention encompasses nucleic acid molecules which do not 
include regions that do not contribute to the desired functionality of inserting a 
heterologous nucleic acid into an avian genome with reduced or no chromosomal 
30 positioning effect. The region 5' upstream of the MAR located at nucleotide positions 
41701-41900 of SEQ ID NO: 1, may be deleted to give a truncated variant of SEQ ID 
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NO: 1. For example, the approximately 11.5 kb region extending from nucleotide 
position 1 of SEQ ID NO: 1 not present in BAG 26 may be deleted. Likewise, it is 
contemplated that other regions of SEQ ID NO: 1 as listed in Table 2, such as 
encoding the serpin-like proteins, may be selectively deleted. 
5 Recombinant nucleic acids 

Another aspect of the present invention is recombinant nucleic acid molecules 
comprising at least one, and preferably at least two, avian MARs and an avian 
ovalbumin transcription regulatory region, including the proximal promoter thereof. 
The recombinant nucleic acid molecules of the present invention are particularly 

10 useful for delivering a desired heterologous nucleic acid to a recipient avian cell 
while reducing chromosomal positional effects upon transcription from the integrated 
heterologous nucleic acid. It is contemplated that regions of SEQ ID NO: 1 may be 
omitted from the recombinant nucleic acid molecules of the present invention without 
substantially affecting the reduction in the CPE compared to a similar nucleic acid 

15 molecule not including MAR elements. For example, one or more of the serpin- 
encoding regions of SEQ ID NO: 1 listed in Table 2, below, may not be included. 

The present invention, therefore, provides recombinant nucleic acid molecules 
that comprise at least one avian MAR and an avian ovalbumin transcription 
regulatory region optionally operably linked in a linear array to a selected 

20 heterologous or endogenous polypeptide-encoding nucleic acid insert, and which may 
express the nucleic acid insert when transfected to a suitable host cell, preferably an 
avian cell. 

The nucleic acid insert, such as a heterologous nucleic acid can be operably 
linked 3' downstream of the ovalbumin proximal promoter and is thereby expressed 

25 as an RNA transcript by a transfected recipient cell. The heterologous nucleic acid 
may be inserted into the recombinant nucleic acid of the present invention 3' 
downstream of a region encoding a peptide leader region so that a heterologous 
polypeptide encoded by the inserted nucleic acid may include this leader region. It is 
within the scope of the present invention for the recombinant nucleic acid to have the 

30 nucleic acid insert encoding the desired polypeptide to be operably inserted into the 
ovalbumin coding region, or operably replacing the ovalbumin coding region in whole 
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or in part. The generation of BACs comprising a heterologous nucleic acid under the 
transcriptional control of the ovalbumin gene control region according to the present 
invention are described in Examples 2 and 3, below. 

To increase the efficiency of expression of the heterologous nucleic acid 
5 insert, a polyadenylation signal region may be included at the 3' end of the inserted 
nucleic acid to allow the transcript directed by the novel ovalbumin transcriptional 
regulatory region to proceed beyond the nucleic acid insert encoding a selected 
polypeptide thereby providing a transcript further comprising a 3 ! untranslated region 
and a polyadenylated tail. Any suitable functional polyadenylation signal sequence 

10 may be linked to the 3 ! end of the nucleic acid insert including, for example, the SV40 
polyadenylation signal sequence SEQ ID NO: 2 as shown in Fig. 4, bovine growth 
hormone adenylation sequence or the like. It is further anticipated that the 
recombinant nucleic acid molecules of the present invention may comprise the 
chicken ovalbumin 3' domain, or a variant thereof. The ovalbumin 3' domain may 

15 comprise the ovalbumin 3' untranslated region, an ovalbumin gene polyadenylation 
sequence and at least one of the 3' MAR elements identified downstream of the 
ovalbumin-encoding region of SEQ ID NO: 1. If the heterologous nucleic acid is 
inserted within the ovalbumin encoding region, and in-phase with the ovalbumin 
gene, the polyadenylation signal region of the ovalbumin gene may be used. 

20 In one embodiment of the recombinant nucleic acid molecule according to the 

present invention, the recombinant nucleic acid molecule comprises the nucleotide 
sequence according to SEQ ID NO: 1, or the complement thereof. 

Another aspect of the present invention is a recombinant DNA molecule 
comprising a MAR element and an avian ovalbumin transcriptional regulatory region. 

25 In one embodiment, the ovalbumin transcriptional regulatory region is operably 
linked in linear array to a nucleic acid insert encoding a polypeptide sought to be 
expressed, and a polyadenylation signal sequence optionally operably linked thereto. 
It is contemplated that when the recombinant nucleic acid molecule is to be delivered 
to a recipient avian cell for expression therein, the sequence of the inserted 

30 heterologous nucleic acid sequence may be modified so that the codons thereof are 
optimized for the codon usage of the recipient species as described below. In a 
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preferred embodiment, a MAR element is located 5 f upstream of the ovalbumin 
transcriptional regulatory region. Suitable MAR elements, for example, are located at 
about nucleotide positions about 41701-41800 and about 96401-96800 of sequence 
SEQ ID NO: 1. 

5 In one embodiment of the present invention, the recombinant nucleic acid 

molecule comprises the nucleotide sequence from nucleotides position about 40750 
to 195101 of SEQ ID NO: 1. Various embodiments of the recombinant nucleic acid 
molecules of the present invention comprise a 5' MAR and/or a 3' MAR, and the 
ovalbumin transcriptional regulatory region. In one embodiment, the recombinant 

10 nucleic acid further comprises a T gene. In others, the recombinant nucleic acid 
further comprises at least one nucleic acid region selected from the group consisting 
of the U serpin gene, a V serpin gene, an X gene, a Y gene and a Z serpin derived 
from SEQ ID NO: 1. 

Another embodiment of the recombinant nucleic acid molecules further 

15 comprises at least one avian MAR 3* downstream of the nucleic acid insert. Suitable 
MAR elements for inclusion 3' downstream of the ovalbumin transcriptional 
regulatory region of the recombinant construct of the present invention are found at 
nucleotide positions about 144651-144850, about 150800-151600, about 163701- 
164100, about 186201-186590 and about 190101-190800 of sequence SEQ ID NO: 1. 

20 In one embodiment of the recombinant nucleic acid molecules, the ovalbumin 
transcriptional regulatory region, the avian 5' MAR, and the avian 3' MAR are 
independently capable of hybridizing under high stringency conditions to the nucleic 
acid sequence according to SEQ ID NO: 1, or the complement thereof. 

In various embodiments of the present invention, the recombinant nucleic acid 

25 molecule is inserted into a vector such as, but not limited to, a plasmid or viral vector. 

Other embodiments of the recombinant nucleic acid molecules further 
comprise a plasmid or viral origin of replication. In one embodiment, the 
recombinant nucleic acid molecule is a bacterial or yeast artificial chromosome. 

Yet another embodiment of the recombinant nucleic acid molecule according 

30 to the present invention, therefore, is a recombinant nucleic acid molecule comprising 
an avian ovalbumin transcription regulatory region, an avian 5' MAR, a heterologous 
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nucleic acid encoding a heterologous polypeptide desired to be expressed by a 
recipient genetically modified cell, a polyadenylation signal sequence, and an avian 3' 
MAR, wherein the avian ovalbumin transcription regulatory region, 5' MAR, and the 
3' MAR each independently hybridizes under high stringency conditions to the 
5 nucleic acid sequence SEQ ID NO: 1, or a complement thereof. 

Polypeptide expression under the control of an avian ovalbumin promoter 

Another aspect of the present invention of the novel isolated ovalbumin 
transcriptional regulatory region is increasing the amount of a heterologous protein 
present in a bird (especially the chicken) by gene transfer. Typically, a heterologous 

10 polypeptide-encoding nucleic acid insert transferred into the recipient animal host 
will be operably linked with the ovalbumin transcriptional regulatory region to allow 
the cell to initiate and continue production of the genetic product protein. A 
recombinant DNA molecule of the present invention can be transferred into the extra- 
chromosomal or genomic DNA of the host. 

15 A useful application of the novel isolated and recombinant nucleic acid 

molecules of the present invention is to increase the amount of a heterologous protein 
present in a bird, (especially the chicken) by gene transfer. Typically, a heterologous 
polypeptide-encoding nucleic acid insert transferred into the recipient bird host or an 
isolated cell or cell-line from the bird will be operably linked with the ovalbumin 

20 transcriptional regulatory region to allow the cell to initiate and continue production 
of the genetic protein product. 

The isolated nucleic acid molecule SEQ ID NO: 1 is useful for inserting 
therein a heterologous nucleic acid that is desired to be expressed as a transcript or, 
ultimately, as a polypeptide. A heterologous nucleic acid may be operably linked to 

25 the proximal promoter region of the ovalbumin gene at any position 3 f downstream of 
the promoter that allows transcription from the heterologous nucleic acid and 
synthesis of the desired encoded peptide. Some, or all, of the ovalbumin-encoding 
region of the isolated or recombinant nucleic acids of the present invention may be 
replaced by a heterologous nucleic acid to be expressed under the transcriptional 

30 control of upstream ovalbumin gene control region. The heterologous nucleic acid 
may be inserted into the isolated or recombinant nucleic acids of the present invention 
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so that the expressed amino acid sequences derived from the ovalbumin may be 
linked to the expressed heterologous protein either at the N-terminus or C-terminus 
thereof. 

Any of the vectors of the present invention may also optionally include a 
5 sequence encoding a signal peptide that directs secretion of the protein expressed by 
the vector from the transgenic cells, for instance, from tubular gland cells of the 
oviduct. This aspect of the invention effectively broadens the spectrum of exogenous 
proteins that may be deposited in avian eggs using the methods of the invention. 
Where an exogenous protein would not otherwise be secreted, the vector bearing the 
10 coding sequence is modified to comprise, for instance, about 60 bp encoding a signal 
peptide. The DNA sequence encoding the signal peptide is inserted in the vector such 
that the signal peptide is located at the N-terminus of the protein encoded by the 
vector. 

The expression vectors of the present invention comprise avian ovalbumin 

1 5 transcriptional regulatory regions that can direct expression of either fusion or non- 
fusion proteins. With fusion vectors, a number of amino acids are usually added to 
the desired expressed target gene sequence such as, but not limited to, a protein 
sequence for thioredoxin. A proteolytic cleavage site may further be introduced at a 
site between the target recombinant protein and the fusion sequence. Additionally, a 

20 region of amino acids such as a polymeric histidine region may be introduced to 
allow binding of the fusion protein to metallic ions such as nickel bonded to a solid 
support, for purification of the fusion protein. Once the fusion protein has been 
purified, the cleavage site allows the target recombinant protein to be separated from 
the fusion sequence. Enzymes suitable for use in cleaving the proteolytic cleavage 

25 site include, but are not limited to, Factor Xa and thrombin. Fusion expression 
vectors that may be useful in the present invention include pGex (Amrad Corp., 
Melbourne, Australia), pRIT5 (Pharmacia, Piscataway, NJ) and pMAL (New England 
Biolabs, Beverly, MA), that fuse glutathione S -transferase, protein A, or maltose E 
binding protein, respectively, to the target recombinant protein. 

30 The present invention further relates to nucleic acid vectors and transgenes 

derived therefrom that incorporate polypeptide-encoding regions, wherein a first 
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polypeptide-encoding region is operatively linked to an avian ovalbumin promoter 
and a second polypeptide-encoding region is operatively linked to an Internal 
Ribosome Entry Sequence (IRES). It is contemplated that the first polypeptide- 
encoding region, the IRES and the second polypeptide-encoding region of a 
5 recombinant DNA of the present invention may be arranged linearly, with the IRES 
operably positioned immediately 5' of the second polypeptide-encoding region. This 
nucleic acid construct, when inserted into the genome of a bird and expressed therein, 
will generate individual polypeptides that may be post-translationally modified and 
combined in the white of a hard-shell bird egg. Alternatively, the expressed 

10 polypeptides may be isolated from an avian egg and combined in vitro. 

Expression of a heterologous nucleic acid by a recombinant expression vector 
according to the present invention can be obtained using eukaryotic host cells, 
preferably avian cells, more preferably chicken cells, and still more preferably 
chicken oviduct cells, especially tubular gland cells. The use of eukaryotic host cells 

15 permit partial or complete post-translational modification such as, but not only, 
glycosylation and/or the formation of the relevant inter- or intra-chain disulfide 
bonds. Examples of vectors useful for expression in the chicken Gallus gallus 
include pYepSecl as in Baldari et al 9 E.M.B.O.J., 6, 229-234 (1987) and pYES2 
(Invitrogen Corp., San Diego, CA), incorporated herein by reference in their 

20 entireties. 

One aspect of the present invention is methods of delivering a novel nucleic 
acid molecule of the present invention to the cytoplasm of an avian cell having a 
nucleus, thereby generating a transfected and genetically transformed avian cell. 
Such incorporation can be carried out by the various forms of transfection, depending 

25 upon the vector/host cell system. It is contemplated that the incorporation of 
recombinant nucleic acid molecules of the present invention into a recipient cell may 
be by any suitable method such as, but not limited to, viral transfer, electroporation, 
gene gun insertion, sperm-mediated transfer to an ovum, microinjection and the like. 

In the various embodiments of these methods, the avian cell may be a chicken 

30 cell or a quail cell. In some embodiments of the methods of the present invention, the 
avian cell is within oviductal tissue of a bird, an isolated oviduct cell or primary cell 
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line, or a sustainable oviduct cell line. Preferably, the oviduct cells are tubular gland 
cells. 

Heterologous polypeptide can be produced by transfected cells of the 
invention in vitro, i.e., in tissue culture outside the body of a living animal. 
5 Alternatively, the nucleic acids of the present invention may be delivered to an animal 
such as a chicken, whereupon the nucleic acid may enter cells and be expressed 
therein. It is anticipated that the nucleic acids of the present invention may integrate 
into the genome of the recipient cells and then express the encoded, typically 
heterologous, polypeptide therein. Preferably, a heterologous nucleic acid is 

10 delivered to oviduct cells within a chicken for synthesis of the desired polypeptide 
and its deposition in the white of an egg. 

Another aspect of the present invention is a eukaryotic cell transfected with an 
expression vector according to the present invention and described above. For 
example, in one embodiment, the transformed cell can be a chicken oviduct cell or 

15 cell line, including a sustainable cell line, and the transfected nucleic acid insert 
comprises the chicken ovalbumin transcriptional regulatory region, a 5' MAR and/or a 
3' MAR, a nucleic acid insert encoding a human interferon a2b and codon optimized 
for expression in an avian cell, and an SV40 polyadenylation sequence. In another 
example, the nucleic acid insert encodes an immunoglobulin heavy chain and a 

20 second chain under the transcriptional control of an IRES. 

The transfected cell according to the present invention may be transiently 
transfected, whereby the transfected recombinant nucleic cid, such as DNA, or 
expression vector may not be integrated into the genomic nucleic acid. However, the 
transfected recombinant DNA or expression vector may be stably integrated into the 

25 genomic DNA of the recipient cell, thereby replicating with the cell so that each 
daughter cell receives a copy of the transfected nucleic acid. When the recombinant 
DNA or expression vector of the present invention is integrated into the genomic 
DNA of the recipient cell so that the cell is genetically transformed, it is anticipated 
that the MAR element(s) of the integrated nucleic acid will direct integration a 

30 limited number of integration site within the target genome, thereby producing a 
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population of cells more uniform with regard to the level of expression of the 
heterologous nucleic acid. 

The the present invention also includes a transgenic bird producing a 
heterologous protein expressed from a transfected nucleic acid according to the 
5 present invention. The transgenic bird is selected from a turkey, duck, goose, quail, 
pheasant, ratite, an ornamental bird or a feral bird. In a preferred embodiment, the 
avian is a chicken and the heterologous protein produced under the transcriptional 
control of the avian ovalbumin transcriptional regulatory region according to the 
present invention is produced in the white of an egg. 

10 Viral host cell transformation 

Nucleic acid sequences or derivative or truncated variants thereof, may be 
introduced into viruses such as an adenovirus or vaccinia virus. Methods for making 
a viral recombinant vector useful for expressing a protein under the control of the 
ovalbumin promoter are analogous to the methods disclosed in U.S. Patent Nos. 

15 4,603,112; 4,769,330; 5,174,993; 5,505,941; 5,338,683; 5,494,807; 4,722,848; 
Paolettu E. , 1996, Proc. Natl Acad, Set 93: 11349-11353; Moss . 1996, Proc. Natl 
Acad. Set 93: 11341-11348; Roizman , 1996, Proc. Natl. Acad. Set 93: 11307-11302; 
Frolov et aL , 1996, Proc. Natl. Acad. Set 93: 11371-11377; Grunhaus et al, 1993, 
Seminars in Virology 3: 237-252 and U.S. Patent Nos. 5,591,639; 5,589,466; and 

20 5,580,859 relating to DNA expression vectors, inter alia, the contents of which are 
incorporated herein by reference in their entireties. 

Retrovirus vectors and adeno-associated virus vectors provide efficient 
systems of delivery of genes into cells, and the transferred nucleic acids may be 
stably integrated into the chromosomal DNA of the host. Protocols for producing 

25 recombinant retroviruses and for infecting cells in vitro or in vivo with such viruses 
can be found in Ausubel et aL , 1989, Current Protocols in Molecular Biology §§ 
9.10-9.14 and other standard laboratory manuals. Examples of suitable retroviruses 
include pLJ, pZIP, pWE and pEM which are well known to those skilled in the art. 
Examples of suitable packaging virus lines for preparing both ecotropic and 

30 ampho tropic retroviral systems include psiCrip, psiCre, psi2 and psiAm. 
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Furthermore, it is possible to limit the infection spectrum of retroviruses and 
consequently of retroviral-based vectors, by modifying the viral packaging proteins 
on the surface of the viral particle (see, for example PCT publications WO 93/25234, 
WO 94/06920, and WO 94/11524). Roux et aL , 1898, Proc, Natl Acad, Set 
5 86:9079-9083; Julan et aL . 1992, J. Gen. Virol 73:3251-3255; and Goud et aL , 1983, 
Virology 163:251-254); Neda et aL , 1991, J. Biol Chem. 266:14143-14146), which 
are incorporated herein by reference in their entireties. 

One retrovirus for randomly introducing a transgene into the avian genome is 
a replication-deficient ALV retrovirus. To produce an appropriate ALV retroviral 

1 0 vector, a pNLB vector may be modified by inserting a region comprising at least part 
of the ovalbumin transcriptional regulatory region, a MAR element and one or more 
exogenous genes between the 5' and 3' long terminal repeats (LTRs) of the retrovirus 
genome. Any coding sequence placed in-frame and downstream of the ovalbumin 
promoter will be expressed at high levels and especially in the tubular gland cells of 

15 the oviduct magnum because the ovalbumin promoter drives the high level of 
expression of the ovalbumin protein and is only active in the oviduct tubular gland 
cells. 

Another viral gene delivery system useful in the present invention utilizes 
adenovirus-derived vectors (see, for example, Berkner et aL , 1988, BioTechniques 

20 6:616-629; Rosenfeld et aL , 1991, Science 252:431-434; and Rosenfeld et aL, 1992, 
Cell 68:143-155), incorporated herein by reference in their entireties. Suitable 
adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other strains 
of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art. 
Introduced adenoviral DNA (and foreign DNA contained therein) is not integrated 

25 into the genome of a host cell but remains episomal, thereby avoiding potential 
problems that can occur as a result of insertional mutagenesis in situations where 
introduced DNA becomes integrated into the host genome (e.g., retroviral DNA). 

Yet another viral vector system is the adeno-associated virus (AAV). Vectors 
containing as little as 300 base pairs of AAV can be packaged and can integrate. In 

30 the present invention, at least part of the heterologous nucleic acid will include an 
operable region of the avian ovalbumin transcriptional regulatory region and a MAR 
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element. An AAV vector such as that described in Tratschin et al.< 1985, Mol. Cell 
Biol 5:3251-3260, can be used to introduce DNA into cells. 

Other viral vector systems that may have application in the methods according 
to the present invention have been derived from, but are not limited to, herpes viruses, 
5 vaccinia viruses, avian leucosis viruses and several RNA viruses. 
Non-viral expression vectors 

Most non-viral methods of gene transfer rely on normal mechanisms used by 
eukaryotic cells for the uptake and intracellular transport of macromolecules. In 
preferred embodiments, non-viral gene delivery systems of the present invention rely 

10 on endocytic pathways for the uptake of the subject ovalbumin transcriptional 
regulatory region and operably linked polypeptide-encoding nucleic acid by the 
targeted cell. Exemplary gene delivery systems of this type include liposomal derived 
systems, poly-lysine conjugates, and artificial viral envelopes. 

In a representative embodiment, a nucleic acid comprising the novel 

15 recombinant nucleic acids of the present invention can be entrapped in liposomes 
bearing positive charges on their surface (e.g., lipofectins) and (optionally) which are 
tagged with antibodies against cell surface antigens of the target tissue (Mizuno et al , 
1992, NO Shinkei Geka 20: 547-551; PCT publication WO91/06309; Japanese patent 
application 1047381; and European patent publication EP-A-43075, all of which are 

20 incorporated herein by reference in their entireties). 

In similar fashion, the gene delivery system comprises an antibody or cell 
surface ligand that is cross-linked with a gene binding agent such as polylysine (see, 
for example, PCT publications WO93/04701, W092/22635, WO92/20316, 
W092/ 19749, and WO92/06180, all of which are incorporated herein by reference in 

25 their entireties). It will also be appreciated that effective delivery of the subject 
nucleic acid constructs via receptor-mediated endocytosis can be improved using 
agents which enhance escape of genes from the endosomal structures. For instance, 
whole adenovirus or fusogenic peptides of the influenza HA gene product can be used 
as part of the delivery system to induce efficient disruption of DNA-containing 

30 endosomes ( Mulligan et gL 9 1993, Science 260-926; Wagner et al „ 1992, Proc. Natl 
Acad, Set 89:7934-7938; and Christiano et al , 1993, Proc. Natl Acad. Sci. 90:2122- 
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2126, all of which are incorporated herein by reference in their entireties). It is 
further contemplated that a recombinant DNA molecule of the present invention may 
be delivered to a recipient host cell by other non-viral methods including by gene gun, 
microinjection, sperm-mediated transfer, or the like. 
5 Another aspect of the present invention is a method of expressing a 

heterologous polypeptide in a eukaryotic cell by transfecting a cell with a 
recombinant nucleic acid molecule of the invention, as described above, and culturing 
the transfected cell under conditions suitable for expression of the heterologous 
polypeptide under the control of the avian ovalbumin transcriptional regulatory 
10 region. 

In one embodiment of this aspect, the nucleic acid molecule is integrated into 
the genome of the recipient avian cell. In some embodiments the recipient avian 
oviduct cell is a chicken cell, preferably a chicken oviduct cell, more preferably an 
oviduct tubular gland cell. 

1 5 The protein of the present invention may be produced in purified form by any 

known conventional techniques. For example, chicken cells, an egg or an egg white 
may be homogenized and centrifuged. The supernatant may then be subjected to 
sequential ammonium sulfate precipitation and heat treatment. The fraction 
containing the protein of the present invention is subjected to gel filtration in an 

20 appropriately sized dextran or polyacrylamide column to separate the proteins. If 
necessary, the protein fraction may be further purified by HPLC or other methods 
well known in the art of protein purification. 

Expression of heterologous multimeric proteins by transfected avian cells 

The present invention provides methods for the production of a multimeric 

25 protein by an avian cell, comprising the step of culturing an avian cell transfected 
with a first expression vector and, optionally, a second expression vector; the 
expression vectors may each have a transcription unit comprising a nucleotide 
sequence encoding a first heterologous polypeptide, a transcription promoter, and a 
transcriptional terminator operatively linked to the nucleotide sequence encoding a 

30 second heterologous polypeptide, such that the cultured avian cell produces a 
multimeric protein comprising the first and second heterologous polypeptides. 
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The isolated nucleic acids and recombinant nucleic acid constructs derived 
therefrom of the present invention are useful to express nucleic acid sequences of 
polypeptides that are optimized for expression in avian cells, and derivatives and 
fragments thereof. Such derivatives include, for instance, polypeptides with 
5 conservative amino acid replacements, that is, those within a family of amino acids 
that are related in their side chains (commonly known as acidic, basic, nonpolar, and 
uncharged polar amino acids). Phenylalanine, tryptophan, and tyrosine are sometimes 
classified jointly as aromatic amino acids and other groupings are known in the art 
(see, for example, "Biochemistry", 2nd ed, L. Stryer, ed., WH Freeman and 
10 Co., 1981). Peptides in which more than one replacement has taken place can readily 
be tested for activity in the same manner as derivatives with a single replacement, 
using conventional polypeptide activity assays (e.g. for enzymatic or ligand binding 
activities). 

Regarding codon optimization, for example, if the recombinant DNA is 
1 5 transfected into a recipient chicken cell, the sequence of the nucleic acid insert to be 
expressed is optimized for chicken codon usage. This may be determined from the 
codon usage of at least one, and preferably more than one, protein expressed in a 
chicken cell according to well known principles. For example, in the chicken the 
codon usage may be determined from the nucleic acid sequences encoding the 
20 proteins lysozyme, ovalbumin, ovomucin and ovotransferrin of chicken. 
Optimization of the sequence for codon usage elevates the level of translation in 
avian eggs. 

One embodiment of the recombinant nucleic acid of the present invention, 
comprises an insert encodes the human interferon a2b polypeptide. The exemplary 
25 nucleic acid sequence SEQ ID NO: 3 (Fig. 5) encodes the polypeptide human 
interferon a2b in accordance with avian cell codon usage, as determined from the 
nucleotide sequences encoding chicken ovomucin, ovalbumin, ovotransferrin and 
lysozyme. 

The invention methods for producing multimeric proteins include 
30 immunoglobulins, such as antibodies, and antigen binding fragments thereof. Thus, 
in one embodiment of the present invention, the multimeric protein is an 



-36- 



WO 2004/094640 



PCT/US2003/039244 



immunoglobulin, wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chains respectively. Illustrative examples of this and 
other aspects of the present invention for the production of heterologous multimeric 
polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 
5 09/877,374, filed June 8, 2001, by Rapp, published as US-2002-0108132-AI on 
August 8, 2002, and U.S. Patent Application No. 10/251,364, filed September 18, 
2002, by Rapp, both of which are incorporated herein by reference in their entirety. 

Accordingly, the invention further provides immunoglobulin and other 
multimeric proteins that have been produced by transgenic avians of the invention. 

10 In various embodiments, an immunoglobulin polypeptide encoded by the 

transcriptional unit of at least one expression vector may be an immunoglobulin 
heavy chain polypeptide comprising a variable region or a variant thereof, and may 
further comprise a D region, a J region, a C region, or a combination thereof. An 
immunoglobulin polypeptide encoded by an expression vector may also be an 

15 immunoglobulin light chain polypeptide comprising a variable region or a variant 
thereof, and may further comprise a J region and a C region. The present invention 
also contemplates multiple immunoglobulin regions that are derived from the same 
animal species, or a mixture of species including, but not only, human, mouse, rat, 
rabbit and chicken. In preferred embodiments, the antibodies are human or 

20 humanized. 

In other embodiments, the immunoglobulin polypeptide encoded by at least 
one expression vector comprises an immunoglobulin heavy chain variable region, an 
immunoglobulin light chain variable region, and a linker peptide thereby forming a 
single-chain antibody capable of selectively binding an antigen. 

25 Another aspect of the present invention provides a method for the production 

in an avian of an heterologous protein capable of forming an antibody suitable for 
selectively binding an antigen. This method comprises a step of producing a 
transgenic avian incorporating at least one transgene, wherein the transgene encodes 
at least one heterologous polypeptide selected from an immunoglobulin heavy chain 

30 variable region, an immunoglobulin heavy chain comprising a variable region and a 
constant region, an immunoglobulin light chain variable region, an immunoglobulin 
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light chain comprising a variable region and a constant region, and a single-chain 
antibody comprising two peptide-linked immunoglobulin variable regions. 

In one embodiment of this method, the isolated heterologous protein is an 
antibody capable of selectively binding to an antigen which may be generated by 
5 combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin light chain variable region, preferably cross-linked by at least one 
disulfide bridge. The combination of the two variable regions generates a binding site 
that binds an antigen using methods for antibody reconstitution that are well known in 
the art. 

10 The present invention also encompasses immunoglobulin heavy and light 

chains, or variants or derivatives thereof, to be expressed in separate transgenic 
avians, and thereafter isolated from separate media including serum or eggs, each 
isolate comprising one or more distinct species of immunoglobulin polypeptide. The 
method may further comprise the step of combining a plurality of isolated 

15 heterologous immunoglobulin polypeptides, thereby producing an antibody capable 
of selectively binding to an antigen. In this embodiment, for instance, two or more 
individual transgenic avians may be generated wherein one transgenic produces 
serum or eggs having an immunoglobulin heavy chain variable region, or a 
polypeptide comprising such, expressed therein. A second transgenic animal, having 

20 a second transgene, produces serum or eggs having an immunoglobulin light chain 
variable region, or a polypeptide comprising such, expressed therein. The 
polypeptides from two or more transgenic animalsmay be isolated from their 
respective sera and eggs and combined in vitro to generate a binding site capable of 
binding an antigen. 

25 Examples of therapeutic antibodies that can be used in methods of the 

invention include but are not limited to HERCEPTIN® (Trastuzumab) (Genentech, 
CA) which is a humanized anti-HER2 monoclonal antibody for the treatment of 
patients with metastatic breast cancer; REOPRO® (abciximab) (Centocor) which is 
an anti-glycoprotein Ilb/IIIa receptor on the platelets for the prevention of clot 

30 formation; ZENAPAX® (daclizumab) (Roche Pharmaceuticals, Switzerland) which 
is an immunosuppressive, humanized anti-CD25 monoclonal antibody for the 
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prevention of acute renal allograft rejection; PANOREX™ which is a murine anti-17- 
IA cell surface antigen IgG2a antibody (Glaxo Wellcome/Centocor); BEC2 which is a 
murine anti-idiotype (GD3 epitope) IgG antibody (ImClone System); IMC-C225 
which is a chimeric anti-EGFR IgG antibody (ImClone System); VITAXIN™ which 
5 is a humanized anti-aVj33 integrin antibody (Applied Molecular 
Evolution/Medlmmune); Campath 1H/LDP-03 which is a humanized anti CD52 IgGl 
antibody (Leukosite); Smart Ml 95 which is a humanized anti-CD33 IgG antibody 
(Protein Design Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD2Q IgGl 
antibody (IDEC Pharm/Genentech, Roche/Zettyaku); LYMPHOCIDE™ which is a 

10 humanized anti-CD22 IgG antibody (Immunomedics); ICM3 is a humanized anti- 
ICAM3 antibody (ICOS Pharm); IDEC-1 14 is a primatied anti-CD80 antibody (IDEC 
Pharm/Mitsubishi); ZEVALIN™ is a radiolabelled murine anti-CD20 antibody 
(IDEC/Schering AG); IDEC-1 31 is a humanized anti-CD40L antibody (IDEC/Eisai); 
IDEC-1 51 is a primatized anti-CD4 antibody (IDEC); IDEC-1 52 is a primatized anti- 

15 CD23 antibody (IDEC/Seikagaku); SMART anti-CD3 is a humanized anti-CD3 IgG 
(Protein Design Lab); 5G1.1 is a humanized anti-complement factor 5 (CS) antibody 
(Alexion Pharm); D2E7 is a humanized anti-TNF-a antibody (CATIBASF); CDP870 
is a humanized anti-TNF-a Fab fragment (Celltech); IDEC-1 51 is a primatized anti- 
CD4 IgGl antibody (IDEC Pharm/SmithKline Beecham); MDX-CD4 is a human 

20 anti-CD4 IgG antibody (Medarex/Eisai/Genmab); CDP571 is a humanized anti-TNF- 
a IgG4 antibody (Celltech); LDP-02 is a humanized anti-a4p7 antibody 
(LeukoSite/Genentech); OrthoClone OKT4A is a humanized anti-CD4 IgG antibody 
(Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 
ANTEGREN™ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT- 152 is a 

25 human anti-TGF-p 2 antibody (Cambridge Ab Tech). 

Production of Exogenous Protein by Transgenic Avians 

Methods for the production of heterologous protein by the avian oviduct and 
the production of eggs which contain heterologous protein involve providing a 
suitable vector and introducing the vector into embryonic blastodermal cells so that 

30 the vector can integrate into the avian genome. A subsequent step involves deriving a 
mature transgenic avian from the transgenic blastodermal cells produced in the 
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previous steps. Deriving a mature transgenic avian from the blastodermal cells 
optionally involves transferring the transgenic blastodermal cells to an embryo and 
allowing that embryo to develop fully, so that the cells become incorporated into the 
bird as the embryo is allowed to develop. Another alternative is to transfer the 
5 transfected nucleus to an enucleated recipient cell which may then develop into a 
zygote and ultimately an adult bird. The resulting chick is then grown to maturity. 

In an alternative embodiment, the cells of a blastodermal embryo are 
transfected or transduced with the vector directly within the embryo. It is 
contemplated, for example, that the recombinant nucleic acid molecules of the 

10 present invention may also be introduced into a blastodermal embryo by direct 
microinjection of the DNA into a Stage X or earlier embryo that had been removed 
from the oviduct. The egg is then returned to the bird for shell development and 
laying. The resulting embryo is allowed to develop and the chick allowed to mature. 

In either case, the transgenic bird so produced from the transgenic 

1 5 blastodermal cells is known as a "founder" Some founders can be chimeric or mosaic 
birds if, for example, microinjection does not deliver nucleic acid molecules to all of 
the blastodermal cells of an embryo. Some founders will carry the transgene in the 
tubular gland cells in the magnum of their oviducts and will express the exogenous 
protein encoded by the transgene in their oviducts. If the exogenous protein contains 

20 the appropriate signal sequences, it will be secreted into the lumen of the oviduct and 
onto the yolk of an egg. 

Some founders are germ-line founders. A germ-line founder is a founder that 
carries the transgene in genetic material of its germ-line tissue, and may also carry the 
transgene in oviduct magnum tubular gland cells that express the exogenous protein. 

25 Therefore, in accordance with the invention, the transgenic bird will have tubular 
gland cells expressing the exogenous protein and the offspring of the transgenic bird 
will also have oviduct magnum tubular gland cells that express the exogenous protein. 
(Alternatively, the offspring express a phenotype determined by expression of the 
exogenous gene in a specific tissue of the avian.) 

30 The invention can be used to express, in large yields and at low cost, a wide 

range of desired proteins including those used as human and animal pharmaceuticals, 
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diagnostics, and livestock feed additives. Proteins such as growth hormones, 
cytokines, structural proteins and enzymes including human growth hormone, 
interferon, lysozyme, and P-casein are examples of proteins which are desirably 
expressed in the oviduct and deposited in eggs according to the invention. Other 
5 possible proteins to be produced include, but are not limited to, albumin, cc-1 
antitrypsin, antithrombin III, collagen, factors VIII, IX, X (and the like), fibrinogen, 
hyaluronic acid, insulin, lactoferrin, protein C, erythropoietin (EPO), granulocyte 
colony-stimulating factor (G-CSF), granulocyte macrophage colony-stimulating 
factor (GM-CSF), tissue-type plasminogen activator (tPA), feed additive enzymes, 
10 somatotropin, and chymotrypsin. Immunoglobulins and genetically engineered 
antibodies, including immunotoxins which bind to surface antigens on human tumor 
cells and destroy them, can also be expressed for use as pharmaceuticals or 
diagnostics. 

One aspect of the present invention, therefore, concerns transgenic birds, such 

15 as chickens, comprising a recombinant nucleic acid molecule of the present invention 
and which preferably (though optionally) express a heterologous gene in one or more 
cells in the animal. Suitable methods for the generation of transgenic avians having 
heterologous DNA incorporated therein are described, for example, in WO 99/19472 
to Ivarie et al \ WO 00/1 1151 to Ivarie etal .: and WO 00/56932 to Harvey et aL % all 

20 of which are incorporated herein by reference in their entirety. 

In various embodiments of the transgenic bird of the present invention, the 
expression of the transgene may be restricted to specific subsets of cells, tissues or 
developmental stages utilizing, for example, trans-acting factors acting on the 
ovalbumin transcriptional regulatory region of the present invention and which 

25 control gene expression in the desired pattern. Tissue-specific regulatory sequences 
and conditional regulatory sequences can be used to control expression of the 
transgene in certain spatial patterns. Moreover, temporal patterns of expression can 
be provided by, for example, conditional recombination systems or prokaryotic 
transcriptional regulatory sequences. The inclusion of a 5' MAR, and optionally a 3 1 

30 MAR region, in the novel nucleic acid molecules of the present invention will allow 
the heterologous expression unit to escape all, or in part, the chromosomal positional 
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effect and therefore be expressed at a more uniform level in transgenic tissues that 
received the transgene by a route other than through germ line cells. 

In various embodiments of the present invention the transgenic avians 
comprise a recombinant nucleic acid comprising SEQ ID NO: 1, a truncated variant 
5 of SEQ ID NO: 1 , or the complement thereof. 

In one embodiment of the present invention, the transgenic avian is selected 
from the group consisting of a chicken, a turkey, a duck, a goose, a quail, a pheasant, 
a ratite, an ornamental bird or a feral bird. In a preferred embodiment, the avian is a 
chicken. 

10 In various embodiments, the transgenic avian produces the heterologous 

polypeptide in the serum or an egg white, or both. 

The present invention is further illustrated by the following examples, which 
are provided by way of illustration and should not be construed as limiting. The 
15 contents of all references, published patents and patents cited throughout the present 
application are hereby incorporated by reference in their entireties. 

It will be apparent to those skilled in the art that various modifications, 
combinations, additions, deletions and variations can be made in the present invention 
20 without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another 
embodiment to yield a still further embodiment. It is intended that the present 
invention covers such modifications, combinations, additions, deletions and 
variations as come within the scope of the appended claims and their equivalents. 

25 

Example 1: Construction of a complete Ovalbumin locus from two overlapping 

BACs. 

A complete ovalbumin locus BAC was created from two overlapping BACs 
that together contained the complete ovalbumin locus, as shown in Fig. 6. The 
30 nucleotide sequences of BAC 120 and BAC 77 are in opposite directions with respect 
to the vector backbone pECBACl . 
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BAC 120 was digested with Not I and a 145 kb fragment was re-cloned, but in 
the reversed orientation (flipped), into Not I digested vector backbone pECBACL 
This resulted in a deletion of a region of approximately 1 1.5 kb from the 5' end of the 
insert sequence of BAC 120 and which was upstream of the DNase I sensitivity 
5 region. The reversed BAC 120 'flip 5 and BAC 77 clones were digested with Srf I and 
RARE digested using an oligonucleotide targeted to an EcoRI site within ovalbumin. 
5' and 3' fragments were isolated by CHEF gel electrophoresis, and ligated together 
to yield the complete contiguous ovalbumin genomic locus BAC. 

10 Example 2: Expression of a heterologous gene by a chicken ovalbumin locus 

BAC 

cDNA constructs encoding immunoglobulin light-chain and heavy-chains of a 
human IgGi kappa monoclonal antibody were inserted in-frame with the ovalbumin 
translation start site of separate ovalbumin locus-containing BACs, as shown in Fig 

15 3. The immunoglobulin chain-encoding cDNAs were first inserted into a plasmid that 
contained a 2.7 kb EcoRI fragment from the ovalbumin gene and which included the 
ovalbumin start site. The resulting vector was then digested with restriction 
endonuclease EcoRI and cloned into an approximately 195 kb ovalbumin BAC which 
had been subjected to EcoRI recA-assisted restriction endonuclease (RARE) 

20 digestion as described by Boren et ah , 1996, Prot ScL 5,: 2479-2484 and 
incorporated herein by reference in its entirety. 

Transgenic birds were created by cytoplasmic co-mi croinjection of human 
light-chain and heavy chain BACs (figure b) followed by ovum transfer as described 
in U.S Patent Application Serial No. 10/251,364 incorporated herein by reference in 

25 its entirety. 

A hen carrying these constructs was grown to sexual maturity. Eggs were 
collected and the egg white material was assayed for the expressed human 
monoclonal antibody using sandwich ELISA as described by Harlow et al. 9 
Antibodies: a Laboratory Manual. 1988, Cold Spring Harbor, NY: Cold Spring 
30 Harbor Laboratoiy. Xiii incorporated herein by reference in its entirety. The human 
monoclonal antibody was captured by a goat anti-human kappa chain specific 



-43- 



WO 2004/094640 



PCT/US2003/039244 



monoclonal antibody and quantified with an alkaline phosphatase conjugated goat 
anti-human gamma detection antibody. Hen # AA698 expressed up to 1025 pg of 
human monoclonal antibody per ml of egg white. 

5 Example 3: Expression of a heterologous gene by a chicken ovalbumin locus 

BAC. 

The open reading frame of the firefly luciferase gene was inserted into the 
ovalbumin translation start site of an ovalbumin locus BAC as shown in Fig 3. The 
luciferase gene was inserted into a plasmid that contained a 2.7 kb EcoRl fragment 

10 from the ovalbumin gene and which includes the ovalbumin start site. The resulting 
vector was then digested with EcoRI and cloned into an approximately 195kb 
ovalbumin BAC which had been subjected to EcoRl recA-assisted restriction 
endonuclease (RARE) digestion as described by Boren et aL , 1996, Prot. Set 5,: 
2479-2484 and incorporated herein by reference in its entirety. 

1 5 Primary tubular gland cells isolated from the oviduct of laying quail (Sanders 

and McKnight, Endocrinology 116, 398-405(1985)), were transfected using the 
ovalbumin-luciferase construct or with a negative control CMV-IFN construct. 
Luciferase activity in cell extracts was analyzed two days post transfection (Table 1). 

Tablel 



20 



DNA 


RLU 


CMV-IFN 


60 


Ovalbumin Luciferase 


274 



Example 4: Basic Local Alignment Search Tool (BLAST) Analysis of the 
25 Complete Ovalbumin Promoter Sequence (SEO ID NO: 1) 

The complete approximately 195kb ovalbumin promoter sequence (SEQ ID 
NO: 1) was submitted to the National Center for Biotechnology Information for 
BLAST alignments with database sequences. Further analysis was by using the 
GenScan and MARWIZ software. Percent identities between the ovalbumin gene 
30 region sequence (SEQ ID NO: 1) and corresponding known ovalbumin promoter 
features are listed in Table 2 below. 
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Table 2: 

Nucleotide positions of identifiable elements in the region of the chicken genomic 

within BACs 1 20, 77 and 26 



Nucleotide Jrositions 


Domain Identity 


5963-1 


Q 


9730-9922 


CR1 


10772-11935 


CpG Island 


18914-19088 


CR1-GG 


20106-20921 


CR1-GG 


39975-24820 


R ATPase 


41119-41177 


CR1 -GG 


41586-41700 


CR1-GG 


41701-41800 


MAR element 


42221-42742 


CpG Island 


43505-46990 


S Gene 


50017-51427 


T Gene 


56001-56201 


MAR-hke element 


56501-56901 


TV S A T"» 1 * 1 f j 

MAR-hke element 


64599-71919 


U Serpin Gene 


58401-58701 


MAR-like element 


74883-75634 


CR1-GG 


75420-75634 


CRlb . 


76251-76451 


MAR-like element 


80151-80451 


MAR-like element 


81125-94938 


V Serpin Gene 


81832-82120 


CR1 


85473-85922 


CR1-GG 


88654-88797 


CR1-GG 


90120-90167 


CR1-GG 


96401-96800 


MAR element 


97884-97965 


Y:OV-l element 
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99080-99107 


SDRE element 


100602-107839 


XGene 


110247-111200 


CR1-GG 


114779-121099 


YGene 


117849-118132 


CR1-GG 


131729-139290 


Ovalbumin 


144651-144850 


MAR element 


147721-155849- 


WGene 


150801- 151600 


MAR element 


156581-157181 


MAR-like element 


157081-157581 


MAR-like element 


157132-157331 


MAR-like element 


159095-165114 


MENT 


163701-164100 


MAR element 


171633-180432 


Zl 


183204-190418 


Z2 


186201-186590 


MAR element 


190101-190800 


MAR element 


192078-195101 


Z3 



Nucleotide positions of protein encoding regions are from the beginning 
of the first exon to the end of the poyadenylation signal-exons are shown in 
Fig. 1 

b Protein coding regions are given in bold 

5 

Although preferred embodiments of the invention have been described using 
specific terms, devices, and methods, such description is for illustrative purposes 
only. The words used are words of description rather than of limitation. It is to be 
understood that changes and variations may be made by those of ordinary skill in the 
10 art without departing from the spirit or the scope of the present invention, which is set 
forth in the following claims. In addition, it should be understood that aspects of the 
various embodiments may be interchanged both in whole or in part. 
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What is Claimed Is : 

1. An isolated nucleic acid molecule comprising an avian matrix attachment 
region and an avian ovalbumin transcriptional regulatory region. 

2. The nucleic acid molecule according to Claim 1, further comprising a second 
5 matrix attachment region. 

3. The nucleic acid molecule according to Claim 1, comprising an avian 5' 
matrix attachment region and an avian 3 1 matrix attachment region. 

4. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule is isolated from a chicken cell. 

10 5. The nucleic acid molecule according to Claim 1, comprising a nucleotide 
sequence having at least about 80% identity to the nucleotide sequence 
according to SEQ ID NO: 1, or the complement thereof. 

6. The nucleic acid molecule according to Claim 1, comprising a nucleotide 
sequence having at least about 95% identity to the nucleotide sequence 

15 according to SEQ ID NO: 1, or the complement thereof. 

7. The nucleic acid molecule according to Claim 1, comprising a nucleotide 
sequence having at least about 99% identity to the nucleotide sequence 
according to SEQ ID NO: 1, or the complement thereof. 

8. The nucleic acid molecule according to Claim 1, comprising the nucleotide 
20 sequence according to SEQ ID NO: 1, or the complement thereof. 

9. The nucleic acid molecule according to Claim 7 3 wherein the nucleic acid 
molecule consists of the nucleotide sequence according to SEQ ID NO: 1, or 
the complement thereof. 
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10. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule is a truncated variant of SEQ ID NO: 1 comprising at least 103 
kilobases of SEQ ID NO: 1. 

11. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
5 molecule comprises a truncated variant of SEQ ID NO: 1 having a 5' end 

selected from the group consisting of the nucleotide positions about 41000, 
about 56000, about 58350, about 76200 and about 80000 of SEQ ID NO: 1, 
and having a 3' end selected from the nucleotide positions about 191500, 
about 187000, about 164500, about 157600, about 157100, about 152000 and 
10 to about 145500 of SEQ ID NO: 1. 

12. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
nucleic acid sequence from about nucleotide position 41500 to about position 
195101 of SEQ ID NO: 1. 

15 13. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
nucleic acid sequence from about nucleotide position 41500 to about position 
187000 of SEQ ID NO: 1. 

14. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
20 molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 

nucleic acid sequence from about nucleotide position 41500 to about position 
164500 of SEQ ID NO: 1. 

15. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 

25 nucleic acid sequence from about nucleotide position 41500 to about position 

152000 of SEQ ID NO: 1. 
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16. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
nucleic acid sequence from about nucleotide position 41500 to about position 
145500 of SEQ ID NO: 1. 

5 17. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
nucleic acid sequence from about nucleotide position 96000 to about position 
195101 of SEQ ID NO: 1. 

18. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
10 molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 

nucleic acid sequence from about nucleotide position 96000 to about position 
191500 of SEQ ID NO: 1. 

19. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 

1 5 nucleic acid sequence from about nucleotide position 96000 to about position 

187000 of SEQ ID NO: 1. 

20. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
nucleic acid sequence from about nucleotide position 96000 to about position 

20 164500 of SEQ ID NO: 1. 

21. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
nucleic acid sequence from about nucleotide position 96000 to about position 
152000 of SEQ ID NO: 1. 

25 22. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule comprises a truncated variant of SEQ ID NO: 1 and comprises the 
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nucleic acid sequence from about nucleotide position 96000 to about position 
145500 of SEQIDNO: 1. 

23. A vector having inserted therein a nucleic acid molecule according to Claim 1. 

24. The vector according to Claim 23 selected from the group consisting of an 
5 artificial chromosome, a plasmid vector and a viral vector. 

25. A liposome composition comprising a nucleic acid molecule according to 
Claim 1. 

26. The nucleic acid molecule according to Claim 1, wherein the nucleic acid 
molecule is a recombinant nucleic acid molecule. 

10 27. The recombinant nucleic acid molecule according to Claim 26, wherein the 
ovalbumin transcriptional regulatory region and the matrix attachment region 
are independently capable of hybridizing under high stringency conditions to 
the nucleic acid sequence according to SEQ ID NO: 1, or the complement 
thereof. 

15 28. The recombinant nucleic acid molecule according to Claim 26, further 
comprising a second matrix attachment region independently capable of 
hybridizing under high stringency conditions to the nucleic acid sequence 
according to SEQ ID NO: 1, or the complement thereof. 

29. The recombinant nucleic acid molecule according to Claim 26, further 
20 comprising a heterologous nucleic acid sequence operably linked to the 

ovalbumin transcriptional regulatory region. 

30. The recombinant nucleic acid molecule according to Claim 26, further 
comprising a endogenous nucleic acid sequence operably linked to the 
ovalbumin transcriptional regulatory region. 
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3 1 . The recombinant nucleic acid molecule according to Claim 26, wherein the 
ovalbumin transcriptional regulatory region is capable of tissue-specific 
transcription by an avian oviduct cell. 

32. The recombinant nucleic acid molecule according to Claim 26, further 
5 comprising an Internal Ribosome Entry Site. 

33. The recombinant nucleic acid molecule according to Claim 32, further 
comprising a second heterologous nucleic acid sequence operably linked to 
the Internal Ribosome Entry Site. 

34. A vector having inserted therein a recombinant nucleic acid molecule 
10 according to Claim 26, 

35. The vector according to Claim 34 selected from the group consisting of a 
bacterial artificial chromosome, a yeast artificial chromosome, a plasmid 
vector and a viral vector. 

36. The recombinant nucleic acid molecule according to Claim 26, further 
15 comprising a polyadenylation signal sequence. 

37. The recombinant nucleic acid molecule according to Claim 29, wherein the 
heterologous nucleic acid sequence encodes a polypeptide having a codon 
complement optimized for protein expression in an avian. 

38. The recombinant nucleic acid molecule according to Claim 26 further 
20 comprising an origin of replication selected from the group consisting of a 

bacterial origin of replication and a viral origin of replication. 

39. The recombinant nucleic acid molecule according to Claim 26 which is a 
bacterial artificial chromosome. 
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40. A recombinant nucleic acid molecule comprising: 

(a) an avian ovalbumin transcriptional regulatory region; 

(b) an avian 5* matrix attachment region; 

(c) a heterologous nucleic acid encoding a polypeptide; 
5 (d) a polyadenylation signal sequence; and 

(e) an avian 3' matrix attachment region, 
wherein the avian ovalbumin transcriptional regulatory control region, the 5 1 
avian matrix attachment region, and the avian 3' matrix attachment region 
each hybridize under high stringency conditions to the nucleic acid sequence 
1 0 SEQ ID NO: 1 , or the complement thereof. 

41. The recombinant nucleic acid molecule according to Claim 40, further 
comprising an Internal Ribosome Entry Site. 

42. A method of generating a genetically transformed avian cell, comprising 
delivering a nucleic acid molecule according to Claim 1 to the avian cell 

15 under conditions whereby a genetically transformed avian cell is generated. 

43. The method according to Claim 42, wherein the nucleic acid molecule enters 
the nucleus of the avian cell 

44. The method according to Claim 42, wherein the nucleic acid molecule 
integrates into the nuclear genome of the avian cell. 

20 45. The method according to Claim 42, wherein the nucleic acid molecule is 
integrated into the nuclear genome such that it is subject to a reduced 
chromosomal positioning effect compared to an integrated molecule not 
having a matrix attachment region element. 

46. The method according to Claim 42, wherein the avian cell is selected from a 
25 chicken cell and a quail cell. 
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47. The method according to Claim 42, wherein the avian cell is an oviduct cell. 

48. The method according to Claim 45, wherein the oviduct cell is a tubular gland 
cell. 

49. The method according to Claim 42, wherein the avian cell is a cultured avian 
5 cell. 

50. A method of expressing a heterologous polypeptide in an avian cell, 
comprising the steps of: 

(a) delivering a nucleic acid molecule according to Claim 29 to a 
recipient avian cell under conditions that generate a genetically 

10 transformed avian cell; and 

(b) culturing the genetically transformed avian cell under 
conditions that produce expression of a heterologous 
polypeptide under the control of the avian ovalbumin 
transcription regulatory region. 

15 51. The method according to Claim 50, wherein the nucleic acid molecule is 
integrated into the nuclear genome of the recipient avian cell. 

52. The method according to Claim 50, wherein the recipient avian cell is a 
chicken cell. 

53. The method according to Claim 50, wherein the recipient avian cell is an 
20 oviduct cell. 

54. The method according to Claim 50, wherein the recipient avian cell is an 
oviductal tubular gland cell. 

55. An avian cell, or the progeny thereof, comprising a nucleic acid molecule 
according to Claim 26. 
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56. The avian cell according to Claim 55, wherein the avian cell, or the progeny 
thereof, expresses the heterologous polypeptide encoded by the nucleic acid 
molecule. 

57. The avian cell according to Claim 55, wherein the avian cell is selected from a 
5 chicken cell and a quail cell. 

58. The avian cell according to Claim 55, wherein the avian cell is a chicken cell. 

59. The avian cell according to Claim 55, wherein the avian cell is an oviduct cell. 

60. The avian cell according to Claim 59, wherein the avian cell is an oviductal 
tubular gland cell cell. 

10 61 . The avian cell according to Claim 55, wherein the avian cell is a cultured cell. 

62. A transgenic avian individual comprising a nucleic acid according to Claim 
26. 

63. The transgenic avian individual according to Claim 62, wherein the avian 
individual is selected from the group consisting of a chicken, a turkey, a duck, 

15 a goose, a quail, a pheasant, a ratite, an ornamental bird or a feral bird. 

64. The transgenic avian individual according to Claim 62 which is a chicken. 

65. The transgenic avian individual according to Claim 62 which produces the 
heterologous polypeptide in serum or egg white. 

66. The transgenic avian individual according to Claim 65 which produces the 
20 heterologous polypeptide in egg white. 

67. An avian egg wherein the egg white comprises a heterologous polypeptide 
expressed from a recombinant nucleic acid molecule according to Claim 29. 
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68. The egg white of an avian egg of Claim 67. 

69. A heterologous polypeptide obtained from an egg of Claim 67. 

70. A heterologous polypeptide obtained from an egg of Claim 50. 
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SEQ ID NO: 1 

1 TGCAAGCTTG CAAGTTTTAC TGTCTCATAT GGTTCACATA CCATTCATTT 
51 AGCGTGGCCT CACAACCAAC TTGGGAACCA CTGCCCTAAG TAACAATACC 
101 TCACATATCC ATAGGAT CTT CAACACCATT CTGCTTAAAA TTCAAATCCC 
151 TTTCATTAAA ATAGAAGATA TTTAGATACA AGATAAATAG AGCATATGTT 

2 01 TGTATGTGGT CATCTTCAGT TCTAGAAACA GTTTCTTCTC TCTTATGCAA 
251 CTGAAAGTAG CTTAGTTTCC ACAGCCCCGT GCAGAGCAAT ACGGTTGTAC 

3 01 TGCTCTCAAA GAAGTAC CAA CACAACATGC ATCAGCAGAG TTGCACAAGA 
351 CTACTGAAAT AAAATTTGCT TAGCTGCTAG TGAAAACATA AGTATCCTTG 

4 01 CTTGTAAACA AAAGCATAAG TCATTTCACA CACCTCCATT TGCAGGGCTT 

Q gene exon 5 

4 51 CAGCTAATCC TCGAAGAGCA AACTTGGTCG GAGAATAGGC TGTATAACCA 
501 AAAAGGCCTA ATTGCCCAGC CTGAGATGAT ACAAACACAA TCCTTCCCAT 
551 CCGTCGTTCC TTCATGGTAG AGATGACTGC ACGACTCGGG TAAACACTAC 

6 01 CTAGATAATT GACTGCCATT AATCTCTGTG GGAAAAAAAA AAGATATTAA 
651 AATGGTATTT CAGTCACTGA TTCAGAATTC TTCAGATGTT TAGTTTTATT 

7 01 TCTCCAAAAT CCCTACTGAG GCTTGCTTTA ACATCAAACT CC ACT AC ATT 
751 TCTATCAGCG AGCAAAGTAA ACAATTGTTT TCTACTGTAT TTGTTTCATA 
801 AACGTTCCAT AG C AAAGAAT CTCACCGTAA GACAAAAAGA AATGCACTGT 
851 ATATACAAGT GTTAATGGCA TTGAGGATAC TAATGATGTG AGCTATTATG 
901 GCTTTCACAA AGAAAAAAGG G C AAATTAC A AAAAAAACTG ACTTGAACAG 
951 TGTCTAGGAA GCAATCCATT TATACAAGAA GTACAAAAAC ATTGAAGTGA 

1001 AAGTTTAAAG GGCTTCCTAA AAAAAAGAAA GAGGAAAAAT TCTATGTATA 
1051 TATGCACAAG CTCTCATCTT GGCAGGTAAG ATTAATTCCT GTTTCAATCT 
1101 CCTGAAGACC TAAAACACAA CTGGTGGCCT ACACAAGCAA TAAAACAGAT 
1151 TATTTACCCT TACTGTTACA CTTACTATAA CATTATGCCT ATT CAGTAT C 

12 01 ACGTGCATTT TCAGATGGCT CTGTGATGAA GTGAGAAT C A AAGTAAC CAG 
1251 AAAGTAGAGA GGCTGTTTGG TTTTTTTTAA TGTATTTTGG TTTGGGTTTT 

13 01 ACTGTTGTTC CTTTCTTTCC ACCAAGCTGC ACCAAGTTAC CAACAGTTTA 
1351 CATAC CGC AG AGTTGAATGC TGACAAGATG GATTCTTCTT T AAC AG ATT C 

14 01 AAAATATGGG AAAAAGACCC ACACATCTTC AC CAGGAAAA AAGAGCCCAA 

14 51 TTCAGTGTCT TTCACAGCGT GTTAAATAAA AGC AC AC AT C AACCTTCCTT 

15 01 TCCATCTTGA GGTCCTTCCC TATCCCAGGT GGGGAATTCA CTGTTCTGAA 
1551 TTGTCCATGG AC CAATAAT C AGGTAAAAAA ATAAGTTGCA ATCCTTGCAA 
1601 ATCTAAAAAT GCATACAAAG ACCACCACTA ACGAAATATA TTGCTACTCA 
1651 CACAGTCCGG AAAGACACAG TAAAATAATC TATTCCTTAG AACAG CGGAA 
1701 TGACAAATTA CTTGCAATGA AAT AAAG CAT GATCACTCAC TTCAAAAGAA 

Q gene exon 4 

1751 TTTACTTCAA TATCCTCAAA TTTTCCTGTA ACTGATGTTC CTGCACAGTT 
18 01 GACAAGCAGG TCAACTGGTC CCAGCTTCTC CTGAGCCTGC AAGAGGACAG 
1851 AAATTAAGCT CTCCAGCAGG AAAAAAAGCA TTCATATAGC ATAGTGTCAT 

Q gene exon 3 

1901 CATTAAATAT ACTGCCACTT GAAACACATC GTACAGAGAT GACCTTAATA 
1951 CATACGCTGG TGGAACAGTT GCTTCTCACC TGTTTTAGAA CGTTCTCCAC 
2 001 CTGTTCATAG TCTTTAGATA CAT CAACAGA AATACACAGC ACAACCTAAA 
2 051 AATGGAAAAT GAAAGAACAT ATTATAACCT CAGTCCAAGA ATGGTTGGCC 
2101 CCAAGTCTTA CAAAAGAAAC TGCACACTGC TTCATGAGGT CATATTAGGG 
2151 ACAAAATTAG AAAAAAACAA GACTAAAAAA AACTGATTAG TGGCACAGAG 

22 01 CTAAATATGC CTAAAAAGTG ATTTAACGTT TACTCTTTGT CTGGGAACAA 
2 251 GAGGTTATGG ATCTTAGTTT GAACGATACA GGAAAATGAA AATAGACACT 

23 01 GGCCCAGGGG AGAATAAATA AATAAATAGT AAAAAAAAGA GAG AC AG CAA 
2 3 51 TGTCTTAATT TTTGTTGCAG CTGGAACGTA GGTATTGATT TTCTTTAAAA 

24 01 CTGCTGATTA CATTTAGCAA CTTGATCTCA GAATCTAAGT TTTGAAGCAC 
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24 51 CTTCCAGTCA ACTTCCAGCC AAAGGAAGCA CTGACTTTGG GGTATCACCA 

25 01 ACTGTAAGAT CACTGCCAAC TCCAGCTTGG CCGGGGATTT ACAAAGAGAT 
2551 TATCCCCTCT GCTAAACAAC TAT CAAGGTT CTGAGGCAAC TCAGTATCTT 

26 01 GAAAGGAGAA GCAATCACAT ACCACAATAG AAGTAGAGAC TCCTGTATTC 
26 51 TCATTCTGAT TTCTACGTCT TACTTTGTCC AATTCCCTTA AAGCTGGTTT 
2 701 GAGGTGAAAT AAAGTCATCA CCGGATTCTA ACAAGCGTCA TCAGGGTCAC 
2 751 TTGTCATGAA GCACGCCAAC AAAAATGAAG AAAAGCTTAC CAAAGTGTAA 
2 8 01 CTAACTGCTT TGGCAGCTAG TCATTTGTCA GTTTTGCCTC TGGTCAGACT 

2 8 51 GCAACACTTC CCAGCTACAT AAACAGAATC TGCTTGCCAT TCCTGTGCAG 
2901 CAGGTTGGCT ATTCCTGCCT TCCAGCACTG TGCATAGCTG GACATCGGAG 
2951 TAAACTTGTG GTGCAAGTTA ATGGTTGGCC AGCAAG CATC TCCCCTTAGG 

3 0 01 GCATTTTTAG ATTTTAGAAA TATTTATCTT TTACATAAGT TTGAGAACAA 
3 0 51 AAATAACAGC TGAGCTAAGA TAACACTGAT TTACTATCTG GATGTCTTTA 
3101 CAAGAT CAGC AAGGTTAAAA ATCACAGTTC CATACGAAAA TTGGACATCG 
3151 GACCACAGAT CAGTTAATCA TACTGAGAAC AATATACTCA GGAACTACTA 
3 2 01 AAGCT TGTAT TGGCTACCAG CACATTATAC AATTCATCTT TTTGCTTCTA 
3 251 TTATTGTATT TCCTTTCTGC TACATTAAGT TGATTACCTC CAGTCCAGAG 

33 01 CATGCACTGT GAATGTGGTC CTAATAAACA GACTATGCTG CCAGGAAGTC 
3 3 51 TAATATCCTC ATTCCAGTTT CTCGTCTTTT GTTCTAATAG CGCATTATCT 

34 01 GACCATTCCT AAAGCATTCG TTTCTCAATA AAAGCTCAAC TCCACTCCCA 
3451 GTGAAGTAAC TAGGAATATT CCATACTGAG AAAGTGATTT TAACCATTTT 
3 5 01 CCAAAAAAAT TATGGG CAAG CATTTCATTT CACTGCATCC CTTTATTTGT 
3 551 AAAGGCTGCA CTGT C AG CAT TCTAAATAAA TCACATACAA ATGTATTTCA 
3 6 01 GAGAACACGT GCCAAATCCA ACACCTAAAG GTACGTTGAC AC C AGAATC C 
3 651 TGGTTCTCAA TGCAACTTAG TTTGACATAT ATATACATGA GCAGAATGCA 
3 701 CAGCATATAA TCATGTAAAA ACTGGTATTT CCTCTGGATT TTTGCTCCCT 
3 751 AG CTTGGCAT CTTGCAAATG TAACAGCCTC TGTGGTCTGG CAACCCATCC 
3 8 01 CAGACAACTC TATACTTTCA GCAATCTTCT TGTACAGCCT CTGTCTCTTT 
3 851 CGCTGCCCTC ATTTGTCAGC TTCTTATCTG CCGTTCTTCT TCTATTTGTG 
3 9 01 TTCTTTGAAT TTAGAACAGG CAGACTTCCC TGTGACAATG ACTACAACTC 

3 951 AACAGTAGGT ACACGAGGGC ACTCCCACTA CAGCCAGATG AAGTATGGAC 

4 0 01 AAGTCTAAAT TTGCTACTGC TAAAACAATC CTGTTTAGCT CACACTGAAG 
4 051 CTTGTTCAGT CTTGAACTTA CTGAAATTTT AACTGAAACT CTTGCAGACA 
4101 CAT AC C TAGT ACTCTTCTTA AGAAAACAGA AATGTAAGT C CACATCTGTC 
4151 AAGCTTGGAC AGATTTTTAA CAAGAGCATT GTGACAGTGT TTGACAAAGG 

42 01 AT CAAAGTCA TTGTCAACCC TGGGAAGAGA CTAGAGCAAC TAAAGGGAAA 
4251 CCCTTACTGG GGTTTATTTG TTTTTAAAAG GGGACAAAAC GTTGCTCCTC 

43 01 CTCACCTCAC ATGACTTCAT ATTTCAAGAT AAATTTGATC TGAGTAAGGC 
4 3 51 AACATGCACA GAAAGAAAGA TCACCTTGAA CCATTTGACA GGAATGAGCA 

44 01 AAGACAGAAT CTTGCTGAGT GTTCTTGATA TCTTCATTTA CATCGGCTAG 
44 51 TACTAACATT GCTTCCAAAT ATCCATCTTT CTCCACTGCT TCTTTTCAGG 
4 501 AGAATATCCA ACTTATCCAT TCAC CAGATT ATT AT AC AC C CAATAGACAC 
4 551 CAAATCCCTT AT CAC AAAC A AATTGCTTGC ACTTGTAATA AAGAAAATAA 
4 6 01 GAAC C AACTA CTGGAG CAC A AGAGAGAGGA AATCAAAGAT GCGATGGTCT 
4 6 51 CAATGCTGTT TCCCGTCTCT TTTGGATTAT GAGTATTAAG AAAAAAAAGT 
4 7 01 TGTAGTATTT CATCTTGAAG CTGTGTTTTT TTGAACACAG AAAGCTGCCC 
4 751 TATTTTAAAA CATAGCACCC CAGCAAGCAT TAATGCACTC AAACAGTGCA 
4 8 01 TTAAGCGTGA ATACATTATT TATTTACCAA AGTACCTCCA GTGTTTAGAA 
4 8 51 ACCTCAGTGC TTTGCCAGGA AAACAGCAAT ACACAGCACA ACATAGGGAA. 
4 9 01 TGAAATCTGA AAATAAGTAC CTAAAATTTA TTCCCCATTA CAATTGCTGC 

4 951 ATTTACGATC TCTCGAAGAG AAACTTGAAA AAAGTAGGTT ATTCTCCTCA 

5 0 01 ATCTTGAACA ATACAAACAC TTATCAAGTG ACAGCAAATT G CAGCTATCA 
5 051 AAAAAAACTC TAATACACCT ACAGGTGAAG ATACCAATCT TTTTCTCTGG 
5101 GAAAAAAAGT TAGCATACTA ACCAGGCAAC ATCAAATGTA GCTGAGTTTG 
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5151 GAACATCACA CGTAGACAGA TTGGAATTTT TTGAATTCAA GTGAGGCCTC 

52 01 ACCTATTTTT AATCAAACAG TTTGGTTTTA AAAACCAGAC TACTGAAGCT 
5 251 ATGTTGAAGC GACTACCTTG AAACACAACA GT TAT AG GT A GCAGCGATCT 

Q gene exon 2 

53 01 TACCTGCTTG TCATTAACAG AGTACTTTTC TATTTCCTTC TTTGTCTGTA 
,5351 ACAGCTTGTT CTGAAACGGA TAATAAGAAC AG CTAAAGC A CCGCAGTTAT 

5401 GAGAACACAT TTGTCTTCAG TATTT CGAGA AGCCCTAGAA ACACTCTCTA 

5451 AAAACAAGAA CATTATC AT C AGCTAGCGAT AACGCTTTAC TATGTT CATC 

55 01 TTAGAGGTTT ATGAACGTAC AAAACATCAA CTCCATGCCA TAGCAATGGA 
5 551 AGAGTTTTAA GGCAAGCTCA TCACAAACTG CATGTTAGAT TACACTACTT 

56 01 ACAAAGGCAG CCATCTCTAC TCTGCAAATA CATCACATTT AAATTATTAG 
5 651 GATCACCTTA AAGCATACAG TGGAAAAAAA AC CAATGAGT AATTAT CAAT 
5701 TCCCAAATTA CTCCTCAACC ATAC CAATTT CATTAATATA AGGTAGC CAG 

Q gene exon 1 

5 751 CAAAAATACA AATACGCTTA CCTCATCTCT TGCAATCAGT GTTATGAAAG 

58 01 CTCCTTGCTT ATAACATTCA ATAGCAATAC ATTTTCCAAT TCCACTGGAG 

5 851 CCTCCAGTAA CCTGGAAGTT ATTTGAAAAG TCAGTTAGTC CGTTATTCTC 

59 01 ATCCCTACCC CTGTATCATA ACAAGTCAAA TCAAAACTCT AAAAGTGTAC 
5951 TGAGATTCAA ATATCACCCA GTGAAGTATG AACACATGAG ATATACAAAA 

6 0 01 GTTTCTTTAC TTACTGAGTA CGAATGGAAA AATGAAATCT AACTACCTTG 
6 051 TCAGGCAACT CTCTCTGTCC CAAAAACCAG CTTTACCTTA ATGGTATTTT 
6101 TTCCATATTT CTTATGTCAG GCTCACTTAG ACCCACTCTC ACGTGC CTTT 
6151 GATTAACTTA TTTTTGAAGT ACGGTAATAG TTATAAGTTT GCAAT C ACTA 

62 01 TTAACATTAA GAACGCAATT ACATCCTGTC CCTCCATGCC AGGCTACAAT 
6251 AAAATG CAT A AAAAGAGCAA TCATAGCTTG TCTGCCAAAA ACACCGTGTA 

63 01 AACAAACACA ATTTAAAAAG TTATCTTCTT TCTTATTAAA TAAACATCCT 

63 51 TAATTTGACA GCTGGCCCCC TGCAGAGCAC GTCTCACAGT TCACATCTGG 

64 01 ATGCTGAACT GCCTCAATGC CCTTCTGTGG GGATCCCAGT TCTAAGCCTC 
64 51 CATAAGCCTT TCCTTCTTCC T CT AAAC CAT AACTCTCAGT GGTCTCCCAT 
6501 GCAAACGGGA AATAAACAGT TTGC CTGCAG ACAAATGCAA GGCGATGGCT 
6551 ACAACCCCCC ACTAATAAAC AGGT TAAAGT CACCTGAACA CCCCCCTCTC 
6 601 TACCCTCCAG AAAAAT CTAT TTGTGAACTC CATTTAAAAC AAAACATAGA 
6651 TGGTGACCCA ACTGCTCTTC TTCTTATCGC AAATAAAACT GT CAT C CTGA 
6 7 01 TACTGACTTT TGGTGATTTG TTCATTTAAA CCCAGCCTCA GTTTGTACGT 
6 751 GCCTTCAGTC C TTAAACAC A CAGAGTTGTA TTTTCCTTGT ATGGCCAAAT 
68 01 GTGTTGAATT TTCTCCTAAG AGCGTATCAC TGAGAAAAGA TTGGAATGTT 

6 851 TCATTGATGA AGGGAGTTCT GAAATACAGG CTTAAATATT TGAGAAAGAG 
6901 ATTCATATTC CTCTGGAAGT TGTCATTTCT GTTCTTTATC AAGACTTCAG 
6951 AGAAAGTAAG GTATCACCTC TATTTCAACT ATGAGAAATG GAATGCACTG 
70 01 AG C CAAT CAG TGGAAGAGCA AAGGCTTCCA GTCTCATCTG CACGTGCCTT 

7 051 TGTTTTGTTT TCAGGCTCAC TACCCTCAAA ATTCTTTCAT ATTCCTGGGG 
7101 GCAGTAAAGT CAAAAGAGTT AGAAGTACGC TCCCATGAGT CACCAAGCTC 
7151 TTCTTTCTTA CGAGTGTTTC ATCACAAACC TTCAAAGAAA ACAGCTCAGT 

72 01 ACACAAAGGG TGATCATGTA ACCTCAGCAA ACTTATTTTC AACAGATTCC 
7251 ACAGTTTTTG TCTAATTTGT CTGCTGTCAC CAGTGGGTCA AGAACTTAGA 
7301 AAGACTAACT T C AACTTAAA C AGAGTTT C A TAGACTTCCC CTATTAGAGA 

73 51 AAGTCGCTTC AAAGCAGCTT CTAATCGACT CTTTACTACG CTAACCTTTG 

74 01 TGAATGACTT T AAGATG CAC GTTCTGTTCC GGAGATGTTT AATGCTGTTG 
7451 CATTCTGAAT AAAGGATGGC TGCTTCCTAA GCAAATTAAG TCTGTAAGAA 
7501 CGAGTGTTAA CGTTTGGGAA CCCTAGCATC ATTTAATTCT T C AC AACAGA 
7 551 AAGAAGTTAA ATTTTTAGTT CATTCCCAAA GCACTTTTGA AG ATT C CAAT 
7 601 ACATCTCTAG AAGGATTAGG AGATTCAGTT TGATGTATTC AGGCAAACCT 
7 651 CAGTTCCAGA GCACTCATGA GCTGCCTTTA CAGATTTTCA GCAGCAGCTG 
7 701 GT AC AT CAGC ACAGGAACTA ATGAATGCAT GTGGAAGACT GCTCCAATTC 
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7751 CCATTTTTGG CACAGCACTG 

78 01 GCTAAGTGTC ATGAACAGGT 

7851 GCATAATGCC TATGGCTTGT 

7901 CAAGGTCTGA AGTCAAGCAA 

7951 CAAGAACGCT AGATAGGGCT 

8 0 01 AATCAGTTAG CTAGGAACTT 

8 051 CGTTGGAGGT TGTTACTAAC 

8101 GAAACAGAAC AACTCATCGA 

8151 CAACTATCAT CAGAAGCGAT 

82 01 CAGGGCTTGC TAGCTTTCAC 
8251 CTTGCTTTTA GACAGTACTC 

83 01 GAACATCCCC GTAACTTGCT 

83 51 AAATGAC C T C TTCCTGAACC 

84 01 TGCTCCCAAT GCAAGAGCTA 

84 51 CAGGAACACC CAAAGCCTGC 

85 01 TGCTGAAACT ACTCCTAGTC 
8551 CATACTGAAG CTTAACTATG 

86 01 GGAAGACAAT CACAGGCAGC 
8 651 CTATGAAAGC AAAAATACAT 
8701 AAAACTGATG CTTGAGT CAA 

87 51 CAGTGCCCAC TGAGATTACC 
8 8 01 CACCCCCACA AG ATT AGAT C 
8851 CTCTCTAAGC AGAAGAGACA 
89 01 CCTCATGTCA CACAGCAAGT 

8 951 CCCCAACAAA AAAACAACCA 

9 0 01 TACCTCAAAC ACCTAGCTCT 
9 0 51 GAAATAGATT CTACAAATGA 
9101 AATTGAGAAT TCAACAAGTA 
9151 CCAAACACAT TCAATGACCC 
92 01 CACATATGAA GGGGGAATTA 

92 51 TTTTTCTGTG AAAACACTTC 

93 01 AATATTTTCA TGACACCTTC 
9351 CCCATTATAC GATGCAAGAA 

94 01 TCACAAGTAC AGTTTCTAAT 
9451 AC AC AATT C A CCCAAACATT 

95 01 CAGGGTTATT AGCAGCACCT 
9551 ACTGCTCCAT GTTT CATGTG 

96 01 AGCCATAGTC AAACAAGTGC 
9651 TCACCCACAG ATTTGTACCC 

97 01 TCTATTGGGA TATGAACACA 

97 51 GAACAGGTTG CCCAAGGAAG 

98 01 AGGCCAGGCT GGATGTGGCT 
9851 CTGCACACAG CAGGGGGGTT 

99 01 ACCCAGGCCA TTCTATAATC 
9951 ATGTCAAATT CAGGTTACGT 

10 0 01 GGGAAGATTT CTCCAACAAG 

10 051 ACTGAAGTGG CTCTTTGAAA 

10101 AAGCGTAACG ACACAAAGAA 

10151 ACAAACAGGT GGACAATACT 

102 01 CTTACTATTC TGTTTCCACG 

102 51 CCTTTGCTTT TACACAGTAC 

103 01 CTGTGGGAAG AACCTCAGAG 
103 51 AGGACATGAT GCTGTAAAGC 



AGGCACAGCT GATTGCTACC TGTACAAATT 
CAACTACAGC AGTGTTTGAA GGTGAAGGTG 
TAGACAACGT TAACGTTCAC TTCGGAAAGA 
GAAAGGGAAG TTTTTATTTG CAGTCATACC 
C ATGT C AG AT TGGT CAGTGA CAAAAAGCAG 
CAGCATTTAG GTGGAAGGCT GTTACTAATA 
ATGCACCACG AGATGCAATG TAACAGTAAA 
TCAATGAGTA AGCACAATTC C AAAT CAT C A 
CAAAATGAAG ACTTGATATG CTCTTTCCAA 
TGAGCAGCAG CACAGGATGC AAGCTGGCTT 
TAACAGTAGG CATTCAGTTC CACTGAAACA 
CCAATTGGCT GGTGTGCCTG ACTGGACTCC 
ACC CATGTG C ATTTAAGC C A CAGGTCACGC 
CAAAGCTGAG ACACGGAGAT GGGTCAACTC 
TGTCCTTATC TGATGGGCTC TGCAGGGAAA 
CTGAGATGGC CCTTACACCT GTGGCTTGGG 
TCCAAGTTAA AGGCCCTGCT CAAACTGAAA 
AAATATCCTT ACATTCCTGG CATAGTTAGC 
TTAAAAGCAT TTTGTTTTAC TAAATAAGCA 
CCTGATAGCT GTGAAGTCCT TCCTATTTAA 
AAAACACACA GCAGATATGG AACAGGAAAA 
AGTAATTTGG AGTTCTGTGG AGCTCCATTC 
CCTGTGGTCT CGGTCCCACT CTAAAGCATA 
GACTCCAGGA GAGGACAGGA GGGAAAAAAA 
CACATTTTCT GCCTCAGATT TCTGTCCTTG 
ATTAACCATT ACAACTGTTA GGAGCAGAGA 
TTAAGTAGGG TGTGAAAAAA TGCTGATTTA 
ACTGCTAGGA AACAATATAC ACAGTAGCGT 
TCTTTAAGAA ATACTACAGG TATTCTCCTG 
TACATACAGA TCACACAAAC AGATTTTGAC 
AAGGAAGTTC TTAACAGTGC TACGGTCCTG 
TGATAATTTC AGTAGCAGAG AACACTCACT 
TTCACCTTGC ACATGTGATG TGTACAGCCT 
TTACAAAACT AT CAT C CAAT GTTTTTCCTT 
TGAAACCCAA ACATGCAAGT CTAACAGCAT 
ACATAT CAAA AAAACTAAAA CTCTGAAACA 
TTTGCACAAA TGCTTTCAGA GCATAACTGA 
TTCTGCCAGA AGCAATGC C A CTGAAGCTGA 
TGATGCCAAT AAATAC CTAC AGCCTTTCCT 

CR1-GG 

GTTAATCCTC CAGAAGGTGG TGACGCACTG 
TTGTAGATGC CCCATCGCTG GAGGCATTCA 
CTGGGCAGAC TGGCCTACTG GTTGGCGACC 
GAAACTAGAT GCTCATTGTG GTCCATTTCA 

CTACGAATAG TCATTTTGAG ACCATCACTT 
GGCTAATACA ATTAGCAGTA GTGGCTGTGA 
ATTATTCTTT GTCATTTTCA TTGTGAGCCA 
AAAGAAGAAC CAGCAGAGTA GCTTTGGAAA 
AAGACAACAC TCGGGATAAT CAGATTAAAA 
C TGGG AT AG A ATACACTGAA CATTTTGTTG 
CAAGCACTGC AGTACCCTTA CCCTGCTTCA 
AGAAGGATTC CTGCTTAGCA AGAGTTACCG 
AGCCTTCACT CACGCTCTAC TAT CT C C AG C 
CAGTTACAAT AC C C AG CAAT AC CTATTGCA 
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10401 TCAAGTAATT TGGGAACACT GTTGCAACTT GGACAGCTCC AAGCCGGGAC 
10451 AGCTCTATCC GCAAAGAGCA GCCCTAAAAC AAATAGGCAG ATAAAAATGA 
10501 ACACAAACAA ACAAGGCACC ACACAGAGCT CAAAAAAATC CCAAATGCCA 
10551 AGCAGCTGTC AATTCCCTCG CACCTCAGAG GTCTAACTTC TGCATTACAC 
10 6 01 CCAAGTCCTG TAGTAACCAA GTCCTGTAGG CAGC CTG CAT GCCCTACTCA 
10 651 CCCCCCAAAA GCAGATTCAG CAAGCAAACA GCACAGCTCC TCTACACGGA 
10701 GCACACCACG GGTAACTACA GCTAAGTCCC CAGAGCTGAC TGAAGGACCA 

CpG island 

10 751 CAGCCGCCCC GACCCGCTCT C ACGCACCCA CATGCTCCAC ACACACCTGC 
10 8 01 AGGCCCCTGG CTGCTGCTCC CCATCACGCA CCCGCCCAGC TTTCAGCGGT 



10851 


ACGCTCCTGG 


GGAACCGTTC 


AAAAGCTATA 


TTTTCCCGAA 


TAAACCCTCC 


10901 


CAAAGGCTCG 


CTCCTACACA 


GCTGATTACA 


GACAAGCCAA 


ACGTCGCTCG 


10951 


TGGACACGGA 


TACCCGCGCT 


GAGTGCCGCC 


TGACCGCTTC 


CCTTCGCGTC 


11001 


AGCCCGCCCG 


TTTCCTCAGC 


ACGGGTCGCC 


TTTCAGCCCG 


TGCCTTCCAC 


11051 


CCTCGTGAGG 


AGGGCCGCAC 


TCCAGCACCT 


CGGCAGATGC 


AGCGGGGCCT 


11101 


TCCCCGGGAC 


ACGGCGGCCG 


CGGCCTCCCC 


GCTCCCTCCT 


CCCGCCCAGC 


11151 


GCCGGCAGCG 


GACCGCTCCC 


CCGCGGTCGG 


T CAGC CAGC A 


GCGGCCGGGA 


11201 


TCGGGGTGGG 


GAGGGGGAGG 


CGAGAGGCCT 


CGTTCGACTC 


AC CACGACGT 


11251 


GTGCGCCGGG 


CAGCTTGAGG 


GGCTTGGGGC 


TGATGAG CGG 


CGACAC CATG 


11301 


TAGAGGAGAA 


GGACGATGGC 


CACGATGAAG 


GCGGCCGCCA 


GGAGCAGCAT 


11351 


GGCTCGGGCC 


GGGCTTGCCG 


CTGGGGAGGG 


GGCGGCGGGC 


GGTGACAAGG 


11401 


CCCCTGGGGC 


TGCGGGAGGC 


GCCGAGCGCG 


GCGCGGCCCG 


GCGCGGCACG 


11451 


GACAGCGGGA 


GTAGAACCGG 


TGCCCGCCTG 


CGCCGCGGCG 


CCACGGGGCC 


11501 


ACAGGGGAGG 


GGGAGGAGGA 


AGAGGAGGAG 


GGAGGAGGAG 


AACACGGCCG 


11551 


CCACTCCGCG 


CCCTGATTGG 


CTGGTGGGCG 


GGGCGGGCGC 


GGCCTCGCGA 


11601 


GCGGGGATTG 


GCCAGTGAGC 


GACGGGAAGG 


AGCTGGCGGA 


TTGGCCGAGA 


11651 


GGCGGGACGG 


CGCTCGGAGA 


TTGGAGCCGC 


AGGCTGTTTG 


CAGGGTCACC 


11701 


GTTGGAGGCA 


AAGGGCGGCG 


GAGAAGAGAG 


AGTTCCTCCG 


GAGAAGACGG 


11751 


GGTGCGGGAC 


GGCGCCCCCT 


ACGGTCCCCG 


CCTAGGGCGG 


GTGAGGCGAG 


11801 


GAGTGAGGTA 


GGCCCCGCCC 


CTTCTCGTCA 


CGTGGGGCTT 


CCCGCCGAAA 


11851 


GGAGGGGGCG 


TGGCTCCGTG 


AGGTGAGAGA 


CGGGAGGCTG 


CGGGCGGCGT 



119 01 TCACGCTCTG GAACGCGAGG GCAGCTGTTT GTGGGGA AAA AAATGTGTAG 
11951 AAGCGTGGTT TCAAAGCATA ATAATAAAAG ATTAACTAAA ACGAAAACGT 
12 0 01 CTTGCAGCTC AAATAAAATG ATTCCGTGCC TACGTTCAAT ATTTCCTTCG 
12 051 CTGTTTATGA CTGAAAGGAA CTCCCTGAAA TG AT TT AT GT TGTAAACGCT 
12101 GTGCAGCCTC TGACTTGTAG AAGGGAGTTT GCAGCGTACG CGGCTTTACG 
12151 GCCTCGGAGA GATTACGATT ACGGGCGAGA GGGCGTGCGG AAGGGTGTGA 

122 01 AGATAATGCA GGAGATGAGA TTGGAGCGGG GAGTAGGCGG AAATGGG AG C 
12251 AGGGCTGCGG G CAGGGATT A GGCAGTCGTC AAGGGGAGAG CAAAGAATCT 

123 01 GGAACAAAAG AATC CAGGAA TTAGTTCTGG AATGGGATTG AGCCGGGATC 

123 51 GGGCTTCGGT GACGCTTTGA GAGTGGTGCT GTGGGGTGGA GGTGTGAGGA 

124 01 AATGAGAGGA AGAAAG AG CG CGTGCTGAGG TAACAGCTGC CACGGCAAGG 
12451 GTGGGGAGAG AG CTGACAAA GTGGTGTGTC CAAGGAAAGG CGGTGTGGAA 
12501 TCGTAGGCAT CCTTAAGGCT GGAAAGGGT C ACCAAGTCCA AG C AC CAAGT 
12551 CCAACTAGCA GAAGTTGGTG TAGGATATGG ACTAGGAACG CTGCAAGCAC 
126 01 AGATACCGAC TTCATTCTTT GCATACAGGG CAGTGTATGT GTTATCTTTT 
12 651 GTAGAATATT AATTAAACAC AAAGGAGGAG ATTGATAATG TAATAGAGCC 
12 7 01 TATTTATAGT TATCTAGTGC AGAATATGGC GAGACTTGAA AAGCCCAAAT 
12 7 51 GTCAGCAGCA TGGAGATAAA G CAG ACGGAG ATAAATCCAT CTTTCACAAT 
128 01 GCGATATCGC TTTCAGAATC AACATGAGGC AAGGGCATGG ATAAAAACAA 
12 851 TCATCTGCAG TTAATTTCTA GTAAAATGAA GGTTAAACAT GTTGGTAGGG 

12 9 01 GGCCTCTAAA AACCTCAAAT GCATGATATG CTCCTGATTG GTCACAGTTA 
12951 GGAT CACATA TTACTAAATA TTTGAGAAGC CCTTGTAGAT TAACGAGGAA 

13 0 01 TCCCCTCGGT GAATTTTATG CAGAAATCCA TACTGTCTTT TCCTTTTAGC 
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13 051 TAAGTGGCCA CTTTACAACC GTGTGATTGA CAATCCAGGT AGCGTCCACT 
13101 CACATTTTGT TCCTGGGGCA GTGAAGTGTC ATGAATTTAT CTCCAAGAAA 
13151 AACATTCAAA AGTGAAGACC TTGTGAACTG CTTATAACTC AC CAATGTAT 

132 01 CGCCACAGCA GTAGGTTTTT GACTCTTTTT AGGTATGCCA GCAGGCACTG 
13251 AAGTTTGCCC TCCTGAGCTG TCTGCTGTCT GGTTTGTATT TGTCTCATGT 
13 3 01 GACCTCATTC ACTGAGGAAG TGCGTTCCTG ACACACGGGA ATGGTTTGCT 

133 51 ACGAAACTCT TTTCTCAGTG ACTGTGGAAC TGGAAATTGA ACCCTAAAAA 
13 401 AAAAAAGTGT TGAAGCCCTC CAGTCCAAAC TTTGGTTGTA CATAAAGCAG 

134 51 TATTTAATTA ATCTGACCTT GATTAACAAC AT CAAAAAGT GTAATTTTGA 
13 501 AGCACAAACT GAC CAAGGTA TGTATGTACC TTCGGGATGG GTAAGAAAAT 
13 551 AAAAAGGTTA ACACATGCTA ATTGCTTTGC TAATTAATCC TTAGAAGCAG 
13 6 01 CTTCAACACA AC AG CGATGT GTTTAGAGAA GAAAATCAAA TACAGGTAGA 
136 51 TTAAAGCGTC CAAACTATAG GACCAGCTGT GGTTTTCTGC TTCCTCAGTT 
13 701 CTGTTCATAT AATCTTTCAA CAGACGTTTG CAGTAACAAT GTTGTGGGTT 
13 751 GAG AT AAAT C AGTATGAACA AAGCATGGCA AC CGAAGTAA GAAAGTAGTC 
13 8 01 AT T TAAAC AC GGAAACAAAT GTATGAATTG ATAATATTAC AACACAAGTG 
13 851 ACTGATACTA GAGGTGTCCT TTTGATCTTC TTGTTCCCAA AGCATACAAG 

13 901 GTACACACAG AAGAGACACA GGCTGTGTTA AGATGCCATT AAGAGAAGGC 
13951 ATAAAGGTTT G ACAGAG C AG GTAGTGAGGT TGCAGCCTGG ACAGACTTTC 

14 0 01 TTATTGCACT TGAGTACTCA TCTGCTGGAT TTTCTGGTTG TGTCATATTC 
14 051 ACGTTAGGGA GAG AG G AGGG AAAAAG AG C A GGATGCGTAG GCTACTCAGT 
14101 GATTAAACAA AAAAAAAAAG CTGGAAACTT CTTCATGTGA TTTCCATCCA 
14151 GTCAGTCCTT CTGCTTTTAG AGAAAG CAGC ATGAAGGAAA AACTTCAGTA 
142 01 GCCAAGGAGA ACAACTTTTT CCTTCTGTTT TCCTGAATTA ACTTACTTTC 

142 51 CTCTCCAACC TTCTCCCTTT TGTGTAGCAA GCATAGGTGT TCTATGCTCA 

143 01 TTTCTTAAGA GGTCTGTTGC AGTAATCATC AT AAGACAT C AAAGGCATGT 

143 51 TGGCAGTTCT TGGATTCCTG CAAAGCTTCA AGATTTAGAA TGATGGCAGT 

144 01 CTAGGTGAGT TGTTCCTGGT CAACAAGCTG TCTTGATCCC GTGTCCCAAA 
144 51 TGAGAAGAGC TAATAGGGAC ATAAGAACTG AAATCAGAAA AGGATTTACA 
14 5 01 TAACATGCTG GCAGTAGAGG AGAATTGGGC AAGAAATAAT GATCTGCACA 
14 5 51 TGGTAGTGAC TAAAGCAGTG TGACTGAAAT ACTTATCACA CCCAGCTGCT 

146 01 TGCCTTGCTG TTCTTCCCCA AACAAACAAG CAAATCCCTT GTAGCTGAAC 
14 6 51 AATAGCTTCT TTACTGGTCC ATCACGCTGG AG AGAT CAT C AGCTACCCCA 

147 01 TGCAT AG C AG GGTGAAACAG CTCCCAGAGC ACTGTGCAGG TCAAAGTACT 
14 751 ATATGTACCC TGTCTGCTGG AGTGCTATCA CGGTGATCTT CTGGGTATTC 
14 8 01 CTAGAAGGAG ATTTCCTGTA CTCCCAAGCT CAACGTATCA TCCAGAAAGT 
14 8 51 GCTCGCCTGC AG CAGGGACG GGTTCTGGCG ATCTCTGCAG CTTCCAGCTA 
149 01 TGCCGCATGC CCTTATCGCA ATGAACTCAG GCTGGGCTGA TGGCCCAGGT 

14 951 GCTGGAGGCT GCCAGCACGC AGGCAGGAGG TGGTTAT AG C AGCTCAGGCT 

15 0 01 CAGGTCAAAC CAAGGCTTCT TGCTGGGGCA GAGGGGACTG ACTCTGTGGT 
15 051 GCAAAAG CAG GTAGTATATA TATATGTATA TATATACAAA GCCCAGCTAC 
15101 CAGCTGAGAG TCCCAAGGCT GCTGCAGTAG TTTTGCAATG AG C AC AC AGG 
15151 AAACAAGAAG ATCGCTGAGA ACACTGCTGA AATCAGATTT CTGTCTTCAC 

152 01 ACAGGT C AAG CTGATTTAAC TGTTTAATGT AATTGCTGCA GTTGCTTGGA 
15251 AAAAAAAAGA AATAGTAAAA CCATGTCCAA AATGAAC CAT TCATAACTGG 

153 01 TGGCCCATTA TGTGTCACAG CCGATGTTGT GCTGAATAAA TAACTGTACA 

153 51 GGTATTTTAT ATATTGAGCA ACATATTTAT TGAAACAAAA ATAATTTACC 

154 01 TCAAACCAGC GGTAAAAGGA AGTCTTTACT GTCTAATTTA AAT AGG CAT A 
15451 AGTTAAACTC GGGACTGAGA TGATCTTGAA TTTCATTTGG TGCCCATGGT 
15501 TCTTTTTATG TGGTACACCT GCTTACACTT ACCATCACAC TGGAGCAGTT 
15551 TGCTTTTGCC ACCCGAATGT CAGACACTGC TATAGATTTA CAGTAGCTTG 
15601 GGGGGGCTGC AGGTTGGAAG AGGGGGTTGA GGCCTCATCA AGTGCCATGG 
15651 CAAAACACCC TCAAGTAAGC ACGGCTGGAA GCAGGAAGGA TGAGGGAATG 
157 01 AGCTGCCATT TCCTTTGCGC TGGAAGGATC ACTGCTAAAA CTTGTAAATA 
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15751 
15801 
15851 
15901 
15951 
16001 
16051 
16101 
16151 
16201 
16251 
16301 
16351 
16401 
16451 
16501 
16551 
16601 
16651 
16701 
16751 
16801 
16851 
16901 
16951 
17001 
17051 
17101 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17651 
17701 
17751 
17801 
17851 
17901 
17951 
18001 
18051 
18101 
18151 
18201 
18251 
18301 
18351 
18401 



TCTGTTAGAA 
CCCATTCTGA 
CCGTGGACGT 
CCTCTTTGAT 
CCCTTTGCCT 
CACGAGACCT 
TTTGTACAAA 
CCTCCTCAGG 
GAATTCTAGT 
ATAGACGTAC 
AGGTCTTATC 
TTTAAGACTT 
AACCTTCCAC 
AATCGGATCC 
GGCAGCTGCC 
GTGCACAGGG 
GGGAATCACT 
GCAGTCCTTC 
CAGGAATATT 
CCTTCCAGCA 
AGCAATTTGG 
TTTAGGAAAT 
AGTTCACAAG 
CACGCCATCC 
TCACAAAAGA 
CTTTGAACTT 
TGAAGCTACT 
AGTTAT C CAT 
CGTACTTTCC 
CTGAAATTAC 
AACTTTCTTT 
AGGCAACACA 
CTTGCTTGGA 
TCCATTATCT 
ATTTAGATGA 
GAGTACAGTA 
TCTCCCCACT 
AGT CTGTTTC 
ACTGCAAAAA 
GAAGGGAAAA 
GCTTGCTTTG 
ATTTTAAAAG 
AAGGACAGAA 
TAGTCGGGCA 
AGGGAGGCAC 
CAAAACCTTT 
GCAATGAAGC 
AGTAAAGCCT 
AAACAAATAA 
CCAGCAATCA 
GTCAGCGTCT 
CAGACCCTCA 
TGTGGAAGCT 
CTGGAACAAG 



ACAAACAGGG 
ATTTTTATCT 
TCGTGTTACA 
GAAGTAAAGC 
GGCTGCCTGT 
GCTCCAGGGC 
ACATGGCGCT 
CACGATGAGT 
TAAGGGCGTT 
TCCAGTCCTT 
CTGAAATTGG 
TGCTGCACTT 
CTCCCAGAAC 
CACAGGCCCT 
CTCTTTCCCA 
CAAAGATCTG 
GCACTGCTGC 
ACCTATGAAT 
TTCACTTTTT 
ACTTACAGCC 
TTGCAGTTAT 
GTACACCTCG 
TTTGATCTGC 
AGT CAAGCCA 
TCAAGTTAAT 
GCTTTTTGTA 
GCAGTGCTCT 
TTCTAGGCCT 
CATTGCCCTT 
TTCAAGTTCG 
GC AC AACCTT 
CTTGTCATAC 
TGCAATTACA 
TTGGCATTTA 
TCTCTTTTTC 
TAGCCTGCTG 
GCGTCACTAC 
CATCCCCTAA 
GGCCAGCTTG 
AAATCAGAGA 
GTAGAAAAGT 
GAATC AG CTT 
TAAATCAGAC 
GAAAGGAGGA 
AGGTTGCCAA 
TTCCTCCTCT 
GAGCTTTTAG 
GCAGAACTTG 
ATAAATAGTA 
CTTCCCCACC 
CCCAACCCGT 
GATTGCCCAG 
CTGAGAATTT 
CGGGCACCTC 
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ACGTTCACTT 
CCTGCAAAGT 
AGATAGC CTT 
TACAGAGT C A 
CCAACCACGT 
CAAGGGAATT 
TATGAGTTTG 
ATATTTTTTC 
TTGATTCCTA 
ATTCCAGAAG 
GATGCAGAGC 
ACTCTGGCTT 
TGCAGCCCAG 
ATCTCACACC 
AGGCACTCTA 
CCGTTCCGAG 
AGCACTATTG 
ATCACTACTA 
AGCCAGATAG 
AATTACTGTC 
TGCTTCTCCA 
AGGTAATCTT 
TTTGGGATTA 
TTTCTATTAT 
GATTTGCAGT 
AGCGATATTC 
GGAGATTTTC 
GCTTGGCCAT 
GTACCTGCAC 
TGCCCTGTCC 
TTTACAAGAG 
ACAGCGCTGG 
CAGCCATGTG 
CTGCAAAGAA 
TTAGCTTACT 
AATGCAAGGA 
CAAATATGTC 
TGGCACGAAC 
TACAGATTTG 
AACTGCCAGC 
TGAAGAAATA 
CTT AT CTT CC 
GGAAAAAGAA 
GGTGGAAAAT 
TTAACACTTC 
AAG AAG CAT C 
ACTAAGACTC 
TTTTTTTCAT 
GCGCAGAGCA 
GTGCTCTCCT 
GCACTTCTTC 
GAAGAACAGA 
ATCTGCCTGC 
TCTGTGCTAC 



TGTCCTGTGA 
TGTATTTAAG 
TGAT ACT AT C 
C AAAG C ATG C 
TGCACCACTA 
GAGCACTTAA 
AAAACGTAGA 
TCCACTACTT 
AAGAATTTTT 
GCTTACTCCT 
C ATT CTGAAA 
CCCACATACC 
CCTATCCTCC 
TCCCGGTTCC 
TGGAGCAG CA 
AGAGCAGAGA 
TATTCTGCCT 
CCTTACTGAA 
GAAG CGGATT 
TCTCCTCCTG 
GAGCGGG CAG 
TGAAGAGTGA 
AGCTACCTGC 
GTGCGTATGG 
CTTCGGCAAG 
TCGGGTACTT 
TTTGTGCTCC 
CCCCATAGCA 
TTGTAAAAAC 
TCTTTCAAAG 
TTAAATCCGT 
CAAGGGACTG 
CCCTTGTTTT 
GCTGCTGTTA 
TCTCCTCTAA 
AACCTGCACC 
AGAGTTGCTT 
CGTTGCCCTC 
CCCTGTAGGT 
TTTTGTTCTG 
GGAACATGCT 
CTTTGGGAAA 
AGAAATTGAC 
ACCCAAAAGC 
GATCAAAGGA 
ACCCCTTCCC 
AAGAGAATAA 
AGCTGACACC 
TCAGCACCGT 
ATAGGAGAGC 
ACGGACAGAT 
ACTGCAATGC 
TGGACAGAGC 
CAGTGCTGGG 



TGCAAGAGCA 
CTGATGTTTA 
AATAACAAGT 
ACTTGTCTGA 
CACCCAGCCC 
GGGAAAGTGC 
T C C AC C AAAA 
ACAGCGCTGT 
CCTTCTAATC 
TGTATTTTGA 
ATGACAGTAT 
TTCCTCTTGC 
TCTGCCAGAA 
CCATCCTCAT 
GAACTGCTGA 
AGCATCGCTC 
TT ATT CAG AG 
TATATATTTT 
TTGTAATTAC 
ATTCCTGTCC 
AATTTTTTGC 
CAGGTTCTAA 
TAAACTACCA 
CTGATTCTTA 
CCTCTGGTTT 
TTTGTGCTTG 
TGGCTGTCAG 
CGGGGAGAAC 
GCTAGAGGAA 
CCATTCTGAG 
TTCTAGTTCC 
CTGTTTATTT 
CAGTCCCTGA 
CGCAATGGAA 
CCCAAGAAAT 
TGCAAACTTT 
GTACTTCTTA 
CTGTTGT CAG 
TTGAATGGAA 
CCGCTTGTAA 
TTGAAATAGG 
AAATAGTGTG 
GTAAGAGAAC 
AGCAGGAAAG 
AAGGCCCGAT 
ACTGCTTACT 
CCCCAATACC 
ACAGACAAAC 
GG C AGT C ATT 
TGCAGCACAG 
TTGCATCATG 
CCAGAAAGAG 
CCATCTACAC 
TAAAGAAAGC 
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18451 TGTGCAGCAG CTCCTCCCTG 

18501 TGGTTC CTGT GGCAGCTCCT 

18551 CAGGCCACCT GCTTGTTTCC 

18601 ACCATGTGCA GGGGGCCTCC 

18651 CATCTCTTAA TTACAGGGGC 

187 01 CAT AC C AAAA CAAGCACACG 
18 751 ACAGCTTTGC AGTTTGTCTG 

188 01 TCGTTACACA TCAGAACTCC 

188 51 GCATTATGAA TGGCACACTT 

189 01 TGGAAT CAT A TCATAGAATC 

18 951 ATGATCAT.CG AGTTTCAACC 

190 01 AGACCAGGCT GCCCAGAGCC 
19051 GATGGGGCAT CCACAACCTC 
19101 CTCTAACATC TTACCACAAC 
19151 TGATGGGTAG GAACTTCCTG 
192 01 TGGAAGGAAA TAATCATTTC 

192 51 CCTCAGCTTT CAAGTAGGCA 
19301 CACTGCTGTA AGTAACAGTT 

193 51 ATGGTGATAT TTATAGCAAC 
19401 ATTGCTACGG TTCTTGTAAC 

194 51 TGCTCCAGCA CCGGTTTTTG 

195 01 TACAGTAGTA TGAGT CAGTG 
19551 TGTGACACCG AGCATGTTTG 
19601 ATTTAAATTA GCTGAGGCAG 
19651 GTCTTGCCAA T C CATGG CAT 

197 01 AGAAAACCCA ACCCTGGCTT 
19751 CCCGTGACTT ATTTCCTGAG 

198 01 AACTCCTATA AGAGAGCTTA 

19 8 51 TGTTTATTTT CATTACTAGT 

199 01 AATGT CTAGT ATTCCTGCAA 
19951 TACCATGCAC AGATGGC AC C 
2 0001 GAGTTGTTGT GGAGTTGCTC 
2 0 051 CATTCATGGA GCTGGAGTCT 

2 0101 AGTGTTCAAG TGGCCTAGGC 

2 0151 ATGAAGTCCA ACAGAACCAA 

2 0201 CTCCCACTAC CAATACAAGC 

2 02 51 GAAGACCTGG GGGGTATTGG 

2 0301 GTACCCTCAC AGCCCAGAAA 

2 0351 AGTGTGGCCA GCACAGCAAG 

20401 GGTGAGGCCT CACCTGGAGT 

20451 GGAGAGACAT GGACCTGTTG 

205 01 TTCTATGGAA TGGGACACCT 

2 0551 GCTGTTCAGC CTGGAGAAGA 

20601 GGCAGTATCT AAAGGGCAGC 

2 0651 CAGGGTCTGG TGTGATAGGA 

2 0701 GAAGATTTAA GTTAGATATA 

2 0751 AGGCACTGGA ACCCAGCGTT 

208 01 AAGGCGAGGC TGGCTCAGGC 

2 0 851 CTACGCACTG CAGGGGAGTT 

2 0901 ACTGCAAAGA TT CTGT GAT T 

2 0951 CCTAGTCAAA AATTGACTAT 

210 01 CAAAACTGTA ATTAGGTCAA 
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AACACTGGCT ACGTTGTGAC ATCAGCCCTG 
GCGCTTCTGC AACTACATGA GTCTAGCTGG 
ATATCAGCAG CAGC CACGTG CACCATGTGC 
AGGCAGGTAA AAAAAACAAA CAAACAAAAA 
AGAGCAGGGC TGGATACGAA CAAACAAAAC 
TGTAAAGAGG AAAAAAAAAT AAAATAAATC 
TCTTCAGAGC AAATCAAGGC TGTGATTAAT 
AAGCAGGCTC AAGCTGAGCC GTTGCAACTG 
GAAAAACAGC CAGGTTGCTT TCCAGATTCA 
CR1 -GG 

ATAGAACGGC CTGGGTTGAA AAGGACCACA 
CCCCTGCTAC GTGCAGGGTC GCCAACCACC 
ACATCCAGCC TGGCCTTGAA TGCCTCCAGG 
CTTGGGCAAC CTGTTCCAAG ATGTTAGCTT 
ATAATAATGA AAGAATATTT AAAAAATCCG 
GCTGCAGCCT GTGCTCCAGC CCTCAGGTGG 
TAGTTGGAAT TTTCATTTTC TTTTTTTTTT 
AACAATTCAC TTGTCTTCCA GAGCTCAAAT 
TTCATTTGTC ATTTTTATTT CCTCTGTGAG 
ATTCTCGGTC CCTTGCTTGG ATGACTTGTG 
AGCATTGCCA GAACAGTAGC AAAAGGCAAC 
TAAGCCATTA CCTGTAGACA CTCATCTGCC 
GAAATTACTG TTTATAGTTT ACAGACCACA 
AAAGCAAAGT CCCTGCCTTG AATAGCTGAG 
CAGAGGAGGA GGGAGGGCAA GCAAAAGCAG 
GGTGC CTAGT GATAGGTCAC CAAGCAGGAA 
CATTATCAAC ATCAGGCCTA TGCTCAGGTG 
AAGTCTCAAA ACACGACCAA CACCTGTTTG 
GCGCCTGCTA TGATGCAGGT AGGATACTGA 
GCGTGACACA T C C AAAG AAA TTAGCTGTAA 
AAGAACGTAA CAGATCCTGC ACGTGGCAGG 
AACGGATGGA TGCTGGCTTC CTCACACGTT 
TGATGAAGGG GAGCAGCATT TGTGAGCATT 
CCTAAGCAAG GTAACGAATG CAAAGGTGGG 
CR1-GG 

AGGCTTGGGC AGTGAGCCCA GGTGAACCTC 
ATGCAAGATC TGGCCTCTGC ATTGAGGCAG 
TGGGAAAGGA CTGAGTGCAG CCCTGCTGAG 
TGGTTGGGAA GCTGGACATG AGCCAGCAAT 
GCCGACTGTA TCCTGGGCTG CAT CAAAAGT 
GGAGGTGCTC CTGCCCCTCT ACTCTGTGCT 
ACTGCATCCA GATGCGGAGT CCTCAGTACA 
GAGCGCATCC AGAAGAGGAC CACAGAAATG 
CTCTTACAAG AACAGGCTGA GAGAGCTGGG 
GAAGGCTGTG AGTTGACCTG ATAGCAACCT 
TACAGGAAAG AAGGGAACAG ACTTTTTAAG 
GAAGGGGAAA TGGTTTCAAG CTCAAAGAGG 
AGGATAAAAT TTTTTACAGT GAGGATGGTG 
GTGGTTGAAG CCCTGACCCC TGAGACTTTC 
CCTGGGCACC CTGATCTAGC TGTGGTGTCC 
GAACTAGATG GCCTTCAGAG GTCCCTTCCA 
CTAGTAAACA GAAAGCGTAC AGAACAGTGA 
CGGAAGGGCG TGTGGGTAGA GGTAGG C AGG 
AGAAAAATGA CAGGACAAGC TTATCTAATA 
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21051 
21101 
21151 
21201 
21251 
21301 
21351 
21401 
21451 
21501 
21551 
21601 
21651 
21701 
21751 
21801 
21851 
21901 
21951 
22001 
22051 
22101 
22151 
22201 
22251 
22301 
22351 
22401 
22451 
22501 
22551 
22601 
22651 
22701 
22751 
22801 
22851 
22901 
22951 
23001 
23051 
23101 
23151 
23201 
23251 
23301 
23351 
23401 
23451 
23501 
23551 
23601 
23651 
23701 



TTTGGGATGT 
TAAGAATTTT 
TAATTGAAAT 
AGAGCACAAT 
ATGTTTTGAT 
CAAACAGTCC 
CTCTAAGTCA 
ATGGAAGCCA 
TATGCAGCAA 
TAAAAAGCTC 
ATAGCGGAGC 
AGATCAAACA 
CTGCCAAAGA 
AAGCTGATTC 
CACTCATTAA 
ATAATAATAA 
GACAAAACCA 
ATTAATCAGG 
TTCTAACTCT 
GAAGAAACAT 
TAAAAACTAT 
TAGCAATAGC 
ACTAGTCACA 
TTAGAGGCTG 
CCAGGCAGCA 
TATTTATTTC 
ATAGAGCATA 
GATTAGCTCT 
GCACAGAAAT 
TGCACCACCC 
AATATTCAGC 
GGAGGAATTC 
ATTCTGTCAT 
TAATGGAATA 
ACACCTTGGG 
CTCATAGTTT 
ACCTCCCGCA 
CTGTGATAAT 
AGGTTATCCT 
TTCTAAAAAC 
CAAGTAGCGA 
AGATAAGGGA 
CCTACGACAA 
TAAGGACTGT 
GGCAGTAGCT 
TTCTGAAGTG 
AGATCAAGTT 
CGCTTGCCCA 
TCACATTTTC 
TGGAAAGAAC 
ATCAAAGAAT 
AAAAGTCTAA 
CAAGAACATA 
TTCACCCATC 



CAGTAGCCAA 
TTACACAGGT 
ATTATTGGTA 
AG CTAC AAC A 
AATAATAAAG 
ATCAAGTGCT 
GCCATGATTA 
ACCATGAAAC 
AAT CAAGGCA 
CCTGTCACGA 
AAATTTTCAT 
ATGGCTCAAC 
GCAAACTACG 
AAGGGTATCT 
AACAAAAGCA 
TAAAATCAAA 
GTACTTAAAC 
TGACGTGGGC 
ATGAAAATAT 
CTGTGAAGAA 
TTTACCCACA 
ACTAGAAGTT 
GCTGGCAGCA 
TGGTTATATA 
TTGTTTAGAG 
CACAGAAACT 
TTCCCAGCTA 
GCATTTTTGC 
CTGCACCAGT 
ACTGCCTCAC 
AAGCATTGAT 
AGTGACACCT 
GCTGCCTCTC 
CTGGTAGGAA 
CTGATGATAT 
AGGTTAACCC 
GCCATTTAGC 
ATTTCAACTT 
TAGAGCTGCA 
AACTTGAAAC 
GATGCAAGTT 
TTACACAGTT 
CAACAAAGAA 
CATATAGGAC 
TCAATCCTCC 
TTCACCAGAG 
TACAAACCAC 
CAAAGGTCCA 
CAGTGCGAGA 
ATTTCATTAT 
TTGTATTTAA 
ACAAAAGTGA 
ACTGATGTTG 
CTATGATTTA 



9/79 

ATGCCAGTAC 
AATTCTGACA 
AAACGGTATT 
TAGACTACCC 
TAGCTGTTGG 
TGCAATAGGT 
CTAGGATGAA 
CACGGTCATC 
TTCCTGCATA 
TAATCTTCAG 
TAGGAAGACA 
AGAATATTTA 
ATCAGGTGCA 
GCCCAGACGA 
GTCAACTCAA 
ACCAAACAAA 
CAGGGAAGGG 
AAAGATACAG 
GTGTTGAGAA 
CAAAGCAAGC 
GAACATAATT 
AGGAAAAAAA 
TGAAAGATAT 
AAGCAAATAT 
TACATAGGTT 
ACAACTGATA 
CAAAACACTA 
CAGCGATGAG 
GGAGGTGCCC 
CGCGCTCACA 
GAATGTCAAG 
CTGCTTCATT 
TGCTGCCATC 
GGTTCAACCA 
ACTAAAATAA 
TGAAAACTGA 
TACATCTTCT 
ATCCTCATTG 
GTTCTAACCG 
ACCAAACATT 
TAAAACATAC 
AGGTTTAGGG 
ACGCATTTCC 
TGTTATAAAG 
AGGTGAGAGC 
G C AAC AG AG C 
AGTAACTTGC 
CAAAAAGATT 
C CGAAGT AAA 
GTTAGCAAAC 
GTGTTTTGTT 
TAGTTTTGAA 
TACATTTTAC 
TCGAGTACGC 



AGAGGATGAA 
ACAGAGAATT 
TTTAAAGAAA 
GCTCAAGAAT 
AAAAGCAGCA 
TATGTAAGTT 
TCTGGTTAAG 
ATTCTGGAGG 
TTTCAATAAT 
AAATAATGTG 
ACTAAATAAA 
AAGCAGTTTC 
GCTGACTGAT 
CGTGTCGACA 
CTCTGAAGGC 
CCTTACCAAT 
ACAGAACTCT 
CCAGAGAATT 
AAT C CACTGT 
AAGCACAGAC 
TTTCATACTG 
CGTAGGCCAA 
GCAAGTAATT 
AAC CTTTAAT 
GCTCTGAAAG 
CGAAGAGCAT 
CTTTTCAACT 
TGAGAGCCTG 
CACCGTCACT 
TCCACTGCTT 
AGGTGTAATT 
TGCGCTTCCA 
TGTCGCACAG 
CTACCGCCAT 
AT ACTAC CTT 
AGTAAATGAC 
GGGATAACTT 
TAGAAAAACA 
GCAGTGTTTA 
AATGTTTCCA 
AGTATATTTT 
AAACCCATAG 
TGAGAATTTT 
GTCAATTAAC 
CCTGCCAGTG 
AAAG AAT CCT 
ATCTACACTT 
TGCAGCCCCC 
GCTGCAAAAC 
AAAAGCT CAG 
TAGCTGTCAA 
AGTAACACTG 
TTCATTTAAG 
AAAATATGTA 



CAGCAACCAT 

TGGGGAGTAA 

AATCAAGGTG 

AGAAGGAGCA 

AAATTGGAAG 

GTGTGAATGG 

ACAAACATGT 

AAGGAAGAAT 

TCAGAGCTAT 

AAAAAAATAC 

CACAAAAAGT 

TTTGCTTCAG 

AGGAG CACAA 

TGTTCTGCTC 

TAGTAGTTGA 

CT CTAAGAC A 

GGATTCAGAG 

TAATGAGTTT 

TAGT C AATGG 

ACAACTGATT 

CAGTCAGAGG 

GTAGTCAAAG 

TATCCAGTGC 

CAACTATAAA 

TAACGCCTCC 

AACAACACTG 

CAGTCATCAC 

CATGCTACGC 

GGTGCTGAAA 

GGTCTCCATA 

TTTTCTGTGT 

GCCAAATGCC 

CCAGAGCACA 

AC C AC C AATG 

TGGGGTAGCA 

ACCTCCCCCT 

AG CTAGGAAT 

G CAGAAC AAA 

TTTGGAATAT 

TTTCCATGAG 

TCTACGTTAA 

GACAAACTGA 

AAGATTGCCA 

AAGTAATTCA 

CGTGGCTCGC 

GCTGCAACTA 

GAATTTCCCC 

TGAAT CACAT 

TGAATGACTT 

CACCTTGCAG 

ACGTAGAAAC 

AAGAAATACT 

TACAG CAAAT 

CATAGAGGAA 
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23 751 ACCAAAACCC ATAAAAAGAC AATCATCTAT GTGCATATGC GCATGTAACA 
23 801 TATGCACATG AAATGTGCAA TTTTCTTTTA ATGCAAGTTA AACAAAGCAT 
23 851 ATGCACAACA GAGTTGCACA AC C ATTAC AG AACAGAGTGT TCTGGGTATC 
23 901 CTCATGATGT TTCGCATCTA CAGCCAGTGC AAACTTACAA GGCACAAACT 

23 951 CAGTGCTGAC ACCGTAGTGT TGTAAGTTCA GGCACATTTC AATTTGTAGT 

24 0 01 TCTTAAAGAT AATAATCAAC AGAAGTGCTA CTTCTGTACT AAAGTGCCAG 
24 051 CCTCTTCCCA AAGATTAAGC ATTAAGTTGA TGTAACCTGT ACACAGTAAT 
24101 GATCAGCGGC GTTCGGATTT AACCTAACCT AT C AC TG C AA GGTCTGTGGC 
24151 TAT AT CGTGC TATGCGCTCC ACACCTCTGA GGGTATGCTG CTTCCCAAAA 
24 2 01 TGCCTCCCTC ACACTCTTCA AAGACTACCC ATACCTCGCC AGCCTTGACG 

242 51 CGTGGACTCT TACAGGTTAC TACTCAATGC TTTTTCCTAA CCTTAGCCAA 

243 01 ACCTCTGATA AAACCAGACT TAAAAAATCA GCCATCGGGA AATCTTTCGC 
24351 ACACTTGCAT TTAACAAACC TTTGCTCAAT TGCATAGTGA CATGTGTATC 
24401 AG CT AGGAAA GAATTAAAAA CAAAAGCTTG CTGCTTAAGG CAAAAATTTT 
24451 TAACAC AG CA CAGCAGAAAA AGC CAAATAC CGGGTTCATC AGTATTTAAA 
24 501 CAAAGCACTG GCTCATACAG TCTTCTCCTC ACAGTGTTTT CTTCCTTACT 
24 551 TTCACAGCAA ACACACACAG TATGCTCAAT TAGCAAATTT TGTTGCATTT 
24 601 CTCTAAACGG AGTGATTAAC ACATAGGCTG ACTGCTACTG AAAACACCTG 
24 651 ACAAATCGCT TCTCTTGCAC CCTCAAAAAA GGGTTTCTTT TTGAGCCTAC 
24 7 01 CAGAAGTTGA AAACCCGCTT GCGCCCAGGT CTAATATAAC AGCTAAAACT 
24 751 GATCATTTAA AAATTACAAA TATTTACCAT GAGTTGCCAC ATCACTCTGC 
24 8 01 TAAAATTGTG TTTTCCGT AT TATTT TCCAA TAGAAGACAT TTAATAGACA 
24 851 TCTGAAGAAA ACAATACAAT ATAAAAGCGT AAGGGTCTTT GCAAACAGAT 
24 9 01 CTTCTATTCC TTCTGCAAAG TAAGAAAGGA GAGAGTTTAT TGGCATTTAT 

24 951 TTGCAGTGCC AT CGATAAAG ACACGAGAAT ACTTAAGAAA GCAAAAAGTT 

R gene exon 9 

25 001 CTAGTGATCC ACAGACATCT TTGGCTTAGC CTTCCTGACC AAAGTCTTCT 
25 051 GTAAACTTCT TTAACTTCTC CAGGTCCTGC TCATTAACTG TTGGCTTTGT 
25101 GCTAGCTAGC GACCTGAGCA TATCGGCCTG TGAGAAAGAA AGCAAGAAAA 
25151 CATGCATTCA GAAAACGTAC CGCTGCTAAC AGTATTGCTG TGAAGAAAAG 
25201 TAAGCTTTGA AAAGCCTTTT AAAACAAGTT GACTGTAGGA ACTCTTATTG 

252 51 AAACAAAACT TCGAGTAAGC CTGAACATTT CTGCACGTGG AC C ACTTTTT 

253 01 AACCTCCTGA CGATAGACAA TTAGTGAGTT TTTACAGGAC TTAAGCCACA 
253 51 ATCTGAGGTT CAGCTTTAAA ACAATTCATC CATT CAACAA GTGTTATCTA 
25401 CCACTGCTTA CTGCAACAAA CTGAGCTTCC CATCTTACAG ATTCGTATTC 
25451 CAATTCACTT TTAAGGACAT CAGGTTGAAG TGGAAAAC CA TCACACGTTC 
25501 CCACATATTC CAATGCCCAC CAACACAGAA TACTTCATGA TTGATCTCCA 
25551 GCAAAGTTTT ACTGCTCATG ACTGCTAACT TCTGTTTCTT C AG CT CAGTC 
25601 AGTTTTGTAT ATTTACATTT GGCTACTAGA AAATGGAGTT CAGAAAAAAA 
25651 AACCACAGAG GTATGAACTC AAATTCAGCA GTTAAGAAAC CTTATTAAAA 
25701 AAAAACGTAT AT AAAAAGT C CTGGCCAAAG GCAAAGCGAG GAGCTGCTCA 
25751 ACACCTCACG TTACTATAAA AGCACAGGGT TAAGTTAAAG T C AGC AT CAT 
258 01 GATTTTCTAG GCTTTCTCAT CTCATCGTAC TACAGACATC CTACTTAGAA 
25 851 AGAATT CAAG TCTGATCTTT TTAATGACAA GAACTGATTC TGGACTCTGA 
25901 AATAAGTCCC TGTGCAACTG TAG C ACAT C A GAGTCTACCT T C CATTAGAA 

25 951 GCACTGAAGG AATTGTATTT AATTCCAGGA AAGACTGATG AAAAATCCAC 

26 001 TTAGTTTACA CAGGCAGAAG TTTTAAGGCA GGCCTGCACT TGCTTGCATC 
26 0 51 TTTTCATGCC TCCTCCATGT GCAAATATGC AGATATTT CT CTCCTCAAAC 
2 6101 TAGTGATGGT TACATGTGCA AAGCAGTGCA CTCTACTTTA GAGGGTTTTT 
26151 GATCCCTATG CAACACACCT TCCTTTCATT CATTACAGAA ACGTTTGCAC 
262 01 ACAGGAATGG CCATCAGCAC AGATCTGATA TCGAGTCCTT CCTTCAGACA 
26251 ATG CAATTAC ATTCAGAACC TTTTGCTGCT TGAGGGTAAA ATATACGAGT 
26301 GCTCAATGAT TTGTAACCTT TTAAACAATG TATTTAAACT TCAATTTCTC 
2 63 51 TCAAATATGA TGTTTTGGTC TGTAGACAGA AGCAAATATT TTAACATATA 
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26401 CAAAAAATTC CAGCTGAATG TTAGCAAGAG CTGGCTGCAT. CAT CTGTGAT 
26451 GAAGTATAAT CCAAACTACC ATTGCATCCA CCAGCTTTTT ACATTGCATT 
26501 GGTTATGCTT GCATTTCTTT TGTGGGCAAA ATTTACCTAC AGCATGTTAT 
26551 TCCCAGTTTA CACTGAATAT AATTTCCCAC TTCTCGATGT CAATAATAAT 
266 01 GCTACAGAGC AACAGGAAAG TAACATATCG TGGGGCAGGG ATTCTGAAGG 
26651 TTTTAAATGA ATAAAAGAAA AATTAAAGAA GGGAGGAAGA TTCAGGTGCT 
26701 GTCTATACTG CATGCCACTA GACAATAATA AATGCTTATC AGGGATGGAG 
26751 AGCTGGCTCG CTGATAAGCA TGTTGTATTG TCATGCTGTG TGTTGCGATT 
26801 AAAATGTCAT CCAGTATGTC CAAGCATGTC TAAAAACAAA GGGCTCAGCC 

26 851 AATTGCCTTG CATGCTGGCT CTAAAATGTC TTGAGTATTT TCAGGGTTCT 
2 6901 GCAAAGCAAG AAACACCACC AAAAAATAAA AAAATAAAAA CAAATACCCA 

R gene exon 8 

26951 CCATGGAAAC TTTAGGCTCC AGTAATTTAT CCCCTGGAAC ATCCATCCAT 
270 01 GTCATTTCTT CAGCTTCAGG AT C AC CTGGA GAGCAAGGAG TGAACAAATC 

27 0 51 T AC CAT GAT A TTTGGATTCG T C AC T GAT GG TCCTTTTACC TAATAAATGA 
2 7101 ATACATAAAT AAATAAAATA AACAAACTGA AGCTGAACAT CTTTAGAGCA 
27151 AAAGCATACT CTTAATTTTC TGTACATGCC CCACCCGTTT GGAGTTGTGT 
27201 AGTGAAGTGG AATTGTGTAA AGGTGCTGGC ATCGTTCACT TTGAAAACGC 
27251 ACAGCAGTAG TCAGATACTT GAACT CAT AC CATGTCAGAA CCAATGAGCC 
27301 TTTAAGGTAG GAATGCTTGT AGAAAGCTAA TGTGCCAGGT CTACTGTTTG 
273 51 GAGAAGAC C A CTCTCTTCTT AGTCCTCAGT CACTTTGGGA GTCCATTCAC 
2 7401 C AC TGGTTAA CATTTCTAAA AAATTCTCAG TAGTTATTAC TGACTGACCC 
2 7451 TCAAGTTGGG CTGCCATGGG TGTCCTTTTA AGCTTCCACT CACTGCACTA 
2 75 01 AAAAGTTCCG GGCACCTTTT CTGACACAAT CTCTAACAGC ACTTGATAGA 
27 551 AGATGGGGCC ATCTAGTGGA GGAACAGAAA CCATCCCTTC TTCCAGATAC 
276 01 ATAGACAGAA CCTGAAAAGC TCCATCAGCT GCCTCTTATC TTTTTGCAAT 
2 7651 GCATATCTCA GACCTGTAGT TCTACCATCC TTCCTTTGTC AGTCACTGAA 
2 7701 GT AT C AC AC A TCCCCATGAA CACAGAACAC ATGCAAAGGC GAAAAAAGAA 
27 751 CTGCTTTTAA CAGCAGAGAA CTGGATTTGC TGTTTCAATC TGCTTTTAAA 
27801 GCACAGCGAA GAAAAGCATG GATTATAATA CTGGAAACTC AACTTGGACA 
2 78 51 AACCGCTATC AATAGGCTGG AACAAGCAAT GGGTTACAGT GAGTTACAGA 

CR1 -GG 

2 79 01 AAT TGAGCAA AACG CTACAA ACAGGAGGC A GGGGCAGATG GCGATTGGGA 
2 7951 CAAGGGGGAA TAGTTTAAAC CAACAGAGGG GAGATGTAGG TGAGATGTTA 



28001 


GGAGGAAACT 


TCTTACTCAG 


AGGGCAGAGA 


GGCGCTGGCA 


CAGCTGCCCA 


28051 


GAGAAGCTGT 


GGTG C C C CAT 


CCCTGGAGGC 


GCCCAAGGCC 


AGGTTGGATG 


28101 


GGGCCCCAGG 


CAGCCTCAGC 


TGGTGGGGGG 


CAGCCCTCAC 


CATGGCATGG 


28151 


GGTTGGAGCT 


GGGTGGGCTT 


TGAGGTCCCT 


TCCAACCCCC 


AACCATCCCA 



2 8201 TGATTCTA TT TAACTGGGAC AAACTGCTAC TATGGAAATA GTTAATAAAG 
2 82 51 CAAAGGTTTT TCTTATAAAA ATAAGAATCT GCATCCAATT AAAGCACAAA 
2 83 01 CAAAACAAGT GGAATAGACT TGC AT CAGAA CACTCAAAGC ACGGTAGGCT 
2 83 51 TTTTTTCCTT TTTGG CAAAA GAGGTAAGAA TTGCCTTTGG CTGCTCTGCA 
284 01 AACTGTGGTA ACTGAGATTA TTTCATTGTT CTGTGGCAGG CTGAGGCACG 
2 8451 CCTCAGATGT CTGCAAATTT CAATGAAAGG CTAAAATGTG ACAACCCATT 
2 85 01 GGC CAGAAAT GCCATCATTG TATAAAAACA ACAATGGATA AATACTTCAG 
2 8 551 GCATCACTGC TTAAGGGAAG GAATAACCCA GAAAATCCCT GAT AT AT C AA 

286 01 AATAGCCGCT TATTTTTTAA GCAAATACAG TTTACAACAG CTCAAAATAC 
2 86 51 TGTTT CAAAA TGTTCTTTGA TTTTAAACTG GGAAAAGTTC AT C AAAATAC 
2 87 01 CT AC CAAATA TTCTTCCTCA CCACCAAAAT TACAGACTGC TGGCGTATTT 

287 51 TAACAAGTTG ATAAGGCTTC CTCACTGCAA GCACTGGAAC TTTAACAGAT 

288 01 CTCTTACATT CTGAAC C ATA TTGTATTTAA GCGTTCCTTT CCCTTGGTGT 
2 8 851 CTTAAGCTGA ATGTGTTCCT TACAATTACA TGGAGAAAAG TGCCCACCTT 
2 89 01 CAGTTCACAC TGACTCTAGC TGTTCAGCTG AGGGCTCTGG ATGAGTTACT 
2 8951 GGTAAAAAAC TAAGAAACTG T CAT C AT AAC TCATGAGCAA CAACTGCTGC 
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29001 CAACACAAGT TGCGTGTATG ACACGCAGAG CAATAAAATG AAAGCTCTGA 
29051 AAGCTTCCCT TTCCAGAGTC AAAAGTCCCT GCAGATAACA AGAATCCACC 
29101 TTCACCTGAA GTTTGTGAAT TTCTGTGAAA ACAAAGTCTG CAGTACAAAT 
29151 GTAAACAGAT TATTTTAGTT TCGCTCTCTA AAACCAAAAC AACAGCAAGA 
29201 AAAAACTAGA CAAGAAAAAT ACTATCATGT TATTTATAAA ATGTAGGCGA 
29251 AACTCCAAGA TAAGCAAAAA AAAAAAAGTC TTATCTATCT ATAGTTACAC 
29301 TCTTTTTAGA CAT C AAC TAA GTGTAAAGTA GTTTTCACTC T AC AG CAGCA 
29351 TCCATAAGAT GTTCCTTGCT GCCCCAGCAA TGACAACGAC CTTACTCAGC 
29401 CGTCTTGCAT CTTAACTACT GTGACAAGTA ACATTAGGGG ATTCAATTTT 
29451 TTACTGGAAT CTTAGGATAA TCTTAATTTT ACAGTTTGAA GGACATCCTG 
29501 AGCAAACAGT TGTGCAGTTG TAATTCCTCT GTTCCCACGT AGATAAGGAA 
29551 TACGTTTATT TACACACATG CGCTAGAAAA ACAATTACGT AATTTGATAT 
296 01 AGAAGAAGAG CACCACTGTA AGACTCCGAT TTAAGTTGAA CTCCAAACCG 
29651 AATGCTTTTA ACAGCAGTTA TAGACGTGAA GATTGATTAG AGCTTGGATT 
29701 ACACAACATG AATAC CTAGA GATGAGGTGC ATCAACTTAT GGCAGGAGTA 
29751 CTCCTTTGGT AGGTAATGAA GAA.CAGCATA CACACATCTG TAAGCACACG 

298 01 GT ATT AC C C C AAAC CGAACT TGGCTTACTT ACAACAAGTT TTCAGATCAA 
29 8 51 GTTAATTCTC AGAGTTGAAG CAATATGAAA AACGTTTTGT TTTTACTTAC 

R gene exon 7 

2 9901 TTTTTTAAAG TGAGTAGCTG ATTGCACTTT TCTAACAGGT TGCATCAGTG 

299 51 CATCGCGTAC AATGATGCTT AT AT CTGC AC CAGAATAGCC ATCGGTTCTT 

3 0001 TTCCCAAGCT CCCGATAATC TGCTTCTGTT AGGAGATTGG GAGTCGACCC 
3 0 051 GAGGTGA&GT TTGAACATGG CAGCCCTGGC ATGGTCTTCA GGTAAAGGAA 
3 0101 TATAAATTCG CTTCTCAAAC CTGGTTTCCA AAAGATAAAA GCACTGGCTC 
3 0151 ACGCAGGTGC ACGATGGAAA GAAGTTTATG CAAATCAGTA TATACTTTGT 
3 0201 TTGTAAATGA AACTGCTTTT TTCTTATGTA TTATAAATGT T T AAAAA.TAT 
3 02 51 ATATCTCAGA TATTCTGCAG CCTGTTCTCA TAAGTAATAC CATGGCTATC 
3 03 01 ATAAGCTAAC ATCTACAATT TAACAACGAC TTCCTTTTTA TGACAGAAAG 
3 03 51 TCTCTTCAGA CTGTAGTTTC TCCAGGTTCA CTC CAGAGAA GTTTGTTTTA 
3 0401 AAAGAAAATA ACTGAAGGAA AAGGAGTCTT TTAGTTTTTA AGTACATCTG 
3 0451 AACAGTTTTC ATAGAAT CTT AGAATCGCTA AGGTTGGAAA AGACCCACAG 

CR1-L 

3 05 01 GAT CAT C C AG TCCAACTATT CACCCATCGC CAACGGTTCT C ACTAAACCA 
30551 TGTCCCTCAA CACAACATCC AAACATTCCT TGAACACCTC CAGGCTCGGT 
3 06 01 GACTCCACCA CCTCTCTGGG CAGCCCATTC CAGTGC CTGA TCACT CTTTC 
3 06 51 AGAGAAATAG TAGTGGTTTT TCACACTCAA AGAAAGAGCT GCCCGATAAC 
3 07 01 ACGTT CACAC AACCAGTTTC TAAAGTTTGT AAGTAGAGAA CGTTGTAGTT 
3 0 751 GGAAACGAAT TTGAAGTCTT ACTCTCAATA TAGTTGTTGG TAGGAATGGT 
308 01 TGATACTTGC GGTGCTTCCT TTGAAGCATC TGTTCTCAAA GAGAGGACGA 
3 0 851 CCTCCCATCA GGGAAATAGG ACCGACTCCA AGTTCTGTAG AACACTATTA 
3 0901 ACTTCCTATA GGTAAGTGGG CCCAAGCCAT GAAAAATTAA TTCTGTTACT 
3 0951 GCCACGCTCT ACAAGCTCCT TTAAGTTTTT CGGACAAGAA TGAGAGATAC 
310 01 TCGTTCACAC TGCAAAGAAT GACTTGAAAT GTTAAGTACC ACATTCGCCT 
31051 CTTATTCCTT GTATGAAACT ACACATGCAC AGGATGGAAG CGACCTCTGG 
31101 AGGCCACATG GTTTAAACTC CCCAGTCAAA GCACGGTCGA GTTGAATTAA 
31151 GT AC AT CG AT AAAATGACAC TGTCACCAAA AAGGATTGTT TCTTTAGCCT 

312 01 ACAAAAATTA CCATTATACA GGTTGTATCA T CAT CAC AAC ATAATCACAT 
31251 TTGTCACGTA ACTGTGTTTG TCCTTTGCTG CTCTGCAACT GAAAGATCCA 

313 01 GCTAATCAGA TACAGATACA AACGT CAT C C CATTAGAGAA AGGCAGTTGA 

313 51 AACGTACACT GAAAGAT CAC ACAAACTGTG TGACCAGTAC AGCAAAAACA 

314 01 ATGCTTCTGC ATTACTTAAA TTCTGTGAAA TTACTCAAGC TATCCAAGGG 
31451 TTTGCTAAAG TTGAAAACGA TAGCTCTGCT GCCTCTTACC CTTCTGACTT 
31501 GCTTATGTTG TACCTTGCCC CCCATGCTCA CCAGGAGACC AGTCAGCAAC 
31551 GAAACACAAG TTTTTTGCTT AGTCAAGTGG AATTAGCTGA CTAAGAGATC 
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316 01 AGACAGACTA CAAGATATAC ATAAGAGAGA ACAATCCACC ACTTAAGTGA 
31651 AGGGGATATT TGACTCAGTC CACCTCATGA GACATGCCTG CAAGAATCAA 
31701 GTGGAT C ACT CACTCAAATA GCCTCAGGAT GAACCCTCAC AATAGTTGCA 
31751 AATTTCTTAG CATAAACATG AATACATCAA T C AT AGGCCA ACATACCTTC 

R gene exon 6 

318 01 TCCTGATAGC AGAATCCAAA ACCCAGGGTA TGTTTGTTGC TCCTAAGACC 
31851 AATATTCCTT CATTATCAAC ACCAACCCCT ATAC CAAGGA AGAAATCATT 

319 01 TCACCATTTA GAAAATAAAC AGAGACTGCC TGATAATGTT TTAGAACATT 
31951 TACAAAACGC AAGGGGGTAA AGCTGCACAT CTTTTCACAT GTAAGCAATG 
32001 CATTTTATGC GTAGCTGAAC TCCTTTGATT CTGAAAACTA TTAAACTTAC 

R gene exon 5 

3 2 051 CTTGCATCTG GACTAGAAAT TCCGTTTTAA TCCGTCTAGC AGCCTCGCTT 
3 2101 TCATTTTCAC TTCTTGACCC ACATAGTGAA TCTATCTCAT CAATGAAGAT 
3 2151 AATAGAGGGC TTGTTTTCTC TGGCAAGCTG GAATAGGTTT TTCACTAATC 

3 22 01 TTAAAAAAGG AAACAGCTGC AGTTATCTTA TTGTACACAC AAGCAAAAAC 
3 2251 ATGCAACTTT GGATTATGAT ACAGTGACTT TGTTAAGAAA AAGCTAAAAG 
3 23 01 TAAAAAATAA AATGAATCCC ACATAAGATA TTAACAAAGC TACTCAAAGA 
3 23 51 TACAACATCC CTTCAGAACT ACTAACACAG CATTAGGCTG AGATGCTGAG 
3 2401 TGAGATACCA CAGAATAAGG TAACTTTAGG CTTCCTAGTC TTGTTAACAC 
3 24 51 ATCTCATTGT AACATG CAGA GTGGATATAT CAAAGGCGCT CATCACTTCC 
3 2501 AACCCATATA TGCCCATCTT TTATGTCTTC AAGATTTTGT T TGAAAAC AG 
3 2551 AATGTAGAAA AAAAACCTTC ACACAGAGGA AGAAACAACA TGTATTATCT 
3 2601 GCAGGGCTAC TGCAACAGAT GAGCCAGAAG GTGACAAGAA TCAAAGTACC 
3 2651 CCAACACTTC AGACCACTTT GTTGTACAAT CACAGCTGGG TTCAGAAGGG 

32 701 CATTGATCAC CATTGTGCTG CTAATACCTT TGTCCAAACT AGTTTTAAAA 
3 2 751 ACAGTCTTGA GTGCTGAAGC TGCTGTAGCA CAAAATACAG TGCATTATGG 
32801 TACTTTTACC TGACACTGCA CTGAAGCAAA GAAAC AT CTA AGGTTTGCTT 
3 2 851 TAACAAGACA CATGAACCTT CCTTCCATTT AATTTCTTTA GAGTGTCCTA 
3 2901 TCTAGCTCTG AAAAATTAAT TTCCTCTTGA TAATATTTTC CTGGAACTCT 

R gene exon 4 

3 2951 GGAAACTCCA ACTTACTTCT CACTCTCTCC TAACCACTTT GAGACCAGGT 

33 001 CAGAGGAAGA TACTGAGAAG AATGTGGAAT TGTTCGCTTC CGTTGCAACA 
33 051 GCTTTTGCTA GATACGACTT TCCTGTTCCT GGAGGTCCAA ATAGAAGAAT 
33101 CCCTCTCCAA GGTGTTCTCT TCCCTGCAAG AAAGAAATCA G CT AT CATC A 
3 3151 AAATGCTGTA TCAAGAGCAA GTCTATCTTT CTGATGAAGC CTCCCTAATG 
33 2 01 TACTAAGTTT TCTGTATGTA CCTAAGAAAC ACCTGTCAGA TCGATCATTT 
33 251 ACAGCTCAGC TGGAGCCTCT GATATAGCAG CATAATGCTC TTCTCAGACT 
33 3 01 CCGCTTACAC TACTCACTTC AACAG CAGT A TTTAGAATGG GAAATAAATG 

R gene exon 3 

333 51 CTGTAATACT GACCTGTGAA CAAGTGTGGA AATTTAATGG GCAAGATAAC 

334 01 TGCTTCTTTA AGAGCTTCTT TGGCACCTTC AAGGCCAGCA ACATCACTCC 
3 3451 ATTTCACATT TGGTCGCTCC ATAACAATGG CAC CTATAAG AAAAGATTGG 
33 5 01 ATAAATCACT G AT ACGTAT T TTTCCACTGT TTGCTTACCA TATATTTGAA 
33 551 AAAAGAAATC CACGTGTATG TTTACATTAA ATAAAAACGA GCCATTTCCA 
33 601 CACAGATTTC AGCATCAAAC AGTGCTACTC AAATGGATAT TATTTCTACA 
3 3651 GAGATTTGGC AATCTTTTTT TCTTTAACCA CAATAAACCA TCAATAAGCA 
33 7 01 GAGAGTTGTT AGAAGTTCTG CAGTGTGCAA ACTAACTCTG CAACTGCGCA 
3 3 751 GAAAACATAC CAATGGCAGA TACAGAAGAG TACACTTCCT AAAAAGAGAT 
3 38 01 CAACATGACG TACACCCTGA TGAAGCAGGC CCACTACAGT AGGATGCACA 
3 3 851 GGAAAGCATG AGCAAAC AC C CTG CTGTG AG C ACT C AGTGT AAAAAGAAAG 
33 9 01 CCTGGAGTAG AGACCAACAT CAAT CTGTAT TGCATCCAAA CCAGAAGAGG 

33 951 CAAAAAAGTG T CT CACTAAG TTGCAGAAAA TGTGAACAGT TCACACAAGA 

34 0 01 CGGATTACTG TGGAGAGAGT AAATATGTGC ACTTTTTATT TTCCCCAATA 
34 0 51 TGTCACCATT ACAAAGGAAA AT CATGG AAT GGTGGAGGGT GATGGAGGCC 
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34101 CAGCCTGGGG CCCCAATACA 
34151 AGCGGTTGTG ATGT CAGCAA 

342 01 CTGTACAGAT TTGGGAT CTG 
34 251 TTCTACTGAG CATAGCTGCG 

343 01 TTGTGCTGTG GGGCTGCTGG 
343 51 CACCAACGTT TCCCCCAGCC 
34401 GGGTGCACTT GTAGCAGCAG 
34451 GCGGCTGGTG GGGAAGTGGG 
345 01 CAGCTGCCTG CCCATGGGAC 
34 551 GTT CGTAACA CCTTCTGCCT 
34 601 CGACCTCAGC TCTCCCAGCC 
34 651 ATCTAACTGG ACCTGCCCCA 
34 7 01 ATGTGACCCC CTGCAAACAC 
34 751 GCAAACAAGA AGCCGAGTTG 
34 8 01 CCGATACTCG GTGAGGTCGG 
34 851 AGCCCGCAGC ACACTCTGAT 
34 9 01 GAGCCGGTGC TCATCCCACC 

34 951 TGCCTTCTTC AAAGAACATC 

35 001 TGCAGGAGGA GATCCAGGAG 
3 5 051 GTGGGACAGT GGACCACTGT 
35101 GGAGGCCTTG ATGCGGGAGC 
35151 CCTTTGTAAG GCAGCTCATC 
3 52 01 ATCCGCCGCC AGCTCGACCA 
3 5251 GGACAGCCCC GCAGCCAGCC 
3 5301 CTGCCTCGGG CCTGGGCCAC 
3 5351 GAGCTGCCCG GCAGCTCTAC 
3 5401 CGCGCCCTCA GCGGAGGAGT 
3 5451 CGGGCCCCTC CGCCCAGGGC 
3 55 01 CCCCCGAAGA GGAAGGC CCG 
3 5551 CAAAAGGCGG CCCCGGCGGC 
3 56 01 AGCACAGCGC CAGCGGGCTG 
3 5 651 CTTGCTGGCA GAATAAACAT 
3 57 01 GTTATTGACA AGACTCTTGC 
3 5 751 TCTCTTCCGG TCCTAGAGGA 
358 01 ATTCAGCATT CATGACAGTA 
3 5851 CGAAGATGGA CCCCTCATGT 
3 59 01 TGGCTAAATT CCTCTCCTAT 
3 5951 TTGATTTTAG CTGCTGAGCT 
36001 TAGTCAGTCT GAAGATAAAA 
3 6 051 CAGCTCACAT CTGAGCTAGT 
3 6101 AAGACAAATC CAAAAGG CAA 
3 6151 CTACCCCAAT GCTTATCTTT 
3 62 01 GCCATTTTTC TCTACAAAGG 
3 6251 TAGCTATTTT CTTGTCACTT 
3 6301 GGTTGATGGC CATTTTCTAC 
3 63 51 CAAATAAAGC AAGTGACATT 
3 6401 TTTTACTTTT CCTGCACGCC 
3 64 51 GCTTGCAATC TATATACAGG 
3 65 01 TACTTTTGCT AGGTTGATTA 
3 6551 T C ACT AAATT AATTAGTAGG 
3 6601 TATCAGTTAT GTGGACCTGG 
3 6651 TAAGACATTA CCCATACCAG 
3 67 01 CTTCCTCATT TTGTAGGTGT 
3 6751 AGCGCAAAAG GTCTTTAAAT 
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TGCAGCAATG GACAGTGAGG TCACCGACCA 
TGGAAATGAC TGTGTCCTCG CTAGCCCTCA 
GCAGAGGCCA GCGTGTACTT GTACCTGGAC 
AGACTCGGAG CACTGAGCGA GTTGGTTGAG 
CAGCAGTTCT TGGTGCCCAC CCCACAGTAC 
CTGCCTGTCT CAGGCAGCTG GGGCCACACA 
AGGTGAGTGG TGCAGGACAT GGCCTCTGCG 
AGGGTTTGCT GCTGAGGGAC CAGGACATCA 
GAGTGACCAT GGCCTCTCTC TCTCTTTGCA 
GCTGCAGTAC CTGTGAGGGG AGCAGCTTCC 
CACCGCACAG CCCGGGGCCA TGGACGTGCC 
TCTGCGGGCA AATT CGGGAG GATGTCACCT 
CAGCTTTGCT ACGGCTGTGC CATCTGGTGG 
TGCCGTATGC GGGCACCAAA TCACCACCAT 
ACGACGACTA CCTCGAGTGT GCTGTCCCGC 
GATAGCCTGC AGGATGAG C A GGGGCCTGCA 
TGAGCACAAC TTCCCTGCCG AGGT CTGGGC 
AGGGAGACCT CGAGCCCCTG CTCCACTGGC 
GCGTCCAGCA GTGACTGGTG GGAGGTGGAA 
CAACTTCCTC TGCGAGCACG GCCTGGACGA 
TGCAGCCGAT CACTAACGGC GATGTGCTGC 
AGCACCGCTA CAGCCCTGTA CGGCCCAGCG 
CCAGGAAGGC CGTGCTGCAG GACAGCGGGA 
CCAGCACCAC CACCTCCCAT CAGGAGCCTC 
TCCACCAGCC CCGCAGGGCC CAGCACCGAG 
TGGGGGACCC GGGCACCCCA GCACCACCAC 
CGCAGGAGGA GCCATGGCAG GCGGTGGCAG 
AGGGACCGCT CGTGTGGGGG GCCCCGGCGC 
CAGCAGCCCC CAGGCCTCGC CCCCACCTCC 
GGCGCTAGGC TGGCACCGCA CTGCCGTCAG 
GGAGGCCAAC ATCTACCTCT CGGCCTGCTG 
CAGTTAAAAC AAAGAAGAAA ATGTCTCTGT 
TGTTGCTGTC CCTACCCATG CTGCTTTCTC 
GAGAAATGCA ACTTTATTTC CAC CAT CAT A 
CTAACAAAGC ACACATAGGC TCCAAAAAGC 
TGCTCTAATC ATAATCCAAC CACCAGGACT 
TGCCAAGCTC TGGGCCACAG ATTACTTCGT 
GTGGTGTCCC CCTCCCTTCA GACTTCCCGT 
ACTCTGTTAC CAGATGACTT TTAGATGGGA 
GACCCAGCTG CACATTTTGA AACCCTACTC 
GAGAAATCTT CCCAAATGAA TTAATGCCAA 
CTGT ACT CAA GCACGGTGAA CTGTT CAGTT 
GCTTTCTATT AGTT CACAAC CAGTTTCTGC 
TCCCCTTGTG CCTTCAGAGC TCTGTGAATT 
AATGGAAAGT GTACCGCTAC TCGTGGCTAA 
TGTTCACTTT TTGTCCATCT CCTTAGAGAT 
TTTCTCATCA GATAGAAAGG AATATTTTTT 
AATCCAGCCA CTCACTTTTA ATGCCCTCAA 
CAACTCAGTT TTTCCTGTAA CCAGGCTCCA 
ACAAGTAGGA ACATGAGATT AGTTCCAAGC 
CATACTGTGG TAATTTAAAT TAGCACACTG 
GAAACAAATG GAACAGGACA TCGATCATGG 
AAAAGAACAG CTGGAAGACT AAGCCAACAG 
ATCAAGCTAA GCCACTTCTT TTCTATGTAA 
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3 68 01 AAAACTACTG CTAGCTGCTA 
3 6 851 TTATTTCAAA AACACAAACA 
3 6901 ATCACTGCAA CACGAGGAAA 
3 6951 CCATCTTCCC GCCTTATCAA 
3 7 001 CCTGCTTATA CTACGTACTT 
3 7 051 TTGCCT CTAG GCAATAATAT 
3 7101 GTCTAACAAA ATAGCTATCT 
3 7151 GTTTTCTTGA TAAATGATCC 
3 72 01 GGCATGCAAA CCCATACTGT 
3 7251 TATATACTAG TTGAGCCACC 
3 7301 CAAACAAACA AACCCCCCAA 
3 7351 CTCTATTTAA ATAAATGGAG 
3 7401 AGTTTCTTTT TCTCAGGATC 
3 7451 CCTGATTTGG AAACAAGAAA 
3 7501 GTCCTAGGTC ACAAATCAGA 
3 7551 GGAATGAAAA ACCTTATGAT 

37 6 01 CATCCTTGTA GTCTTGTTTT 
3 7651 CTGCAACCAT TTCAACTTCT 
3 77 01 CTTCTACATG TGTTGACTCA 
3 7 751 AG C AAAATGG CTTCACTGTA 
3 78 01 GCAACTGGTC TTGGTCAATT 
3 7851 GTCAGAATAA AGGAAACTGG 
3 7901 AGGGGACAAA TGGAATGGAG 
3 7951 AACCAGCAGA ACACAGTACA 
3 80 01 AGACAATGAA AGAAGCTGAA 
3 8 051 CAGTTCTAGC TGTTCAAGAC 
3 8101 CCATTGTGAA ACCTTTCTTC 
3 8151 TGTTCAGGTG GG CTGATTC A 
3 82 01 GTAATAAAAG GAGAGCGGTG 
3 8251 CAGAC CAT CA GAACATCAGC 
3 83 01 ATGTGAGACA GATAAAACTC 
3 83 51 CCAAATCTGG ACTTATTTTC 
3 8401 TGTGTTACTA GGCTCACTGT 
3 8451 GCTTTCATTC TTCAGTGAAT 

38 501 ACAAGCACAA AAAAGCCCAT 
3 8551 AGCCCACTGT CCTTCAGCAT 
3 8601 AACACTACTT AAAATGATCT 

3 8651 AGCGGAGTTC AGGAAGTATT 

38 701 TTCCATCGGC AGGAC CAGAC 

3 8 751 CTCTTTTTCA GATATTCTTT 

3 8 801 ACATTTCACT CTAATGCTCT 

38 851 AGAGTGTGAA AAGAAACAAT 

3 8 901 AT TC AGAAAA CATCTTTATG 

3 8951 CACGCATCCA AATGTTTCAT 

3 9001 GAAGTGACAG GAAAGAAGTT 

3 9 051 TTTCAAATCA AATACACACC 

3 9101 GAACACGAGT AAGGACAGTG 

3 9151 AACTGCTATA TAGTGTCTTA 

3 9201 CTGACTTCTT GTTTTTAGAG 

3 9251 TTCCCCGGCC TGCTAATTTT 

39301 GTCACAAAAC ATCTGAAAAA 

3 9351 TTAGTCGCTA CTGCAGTATT 

3 9401 AGTTCATGCG CTACGTTCCC 
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T AT ATTG CAT CACTGGATGT GTACAGCACG 
ATTATGTTAC TCAACTGAGT AACACCCCTT 
TCCCGCCTGT TGCTATGAAC AAACAAGAAT 
CTTGAGTTCA AGCCTTCCTG TGAAAATGGT 
GGATGACATC TGTTACTTGG ATGACAT CTA 
GTCAATGCAC ATAAGAGTAA AACTAGCACA 
GGGATCTTGC AACTACTCCC TTTGGAAAAT 
AATTTCAACA TATGCACCAC TGAATTTCAT 
CATAAAGACT GTACTTCTGG ATGTAAAGAG 
TAAAGACAAC AAGTTAACTG GCAAAACAAA 
ACAACTAGAA ATTCACTTGA CCAAAGTCAC 
GCTTCAAAGT TACCTTGAAG CTGATTCTGT 
CTCTGACTCT CCTTCCCCAT CACTGTCATT 
TAAAACGTTG AAATACACTG AGAAACTGCT 
AAG CAGGAAG TAGAAAAAAC ATCACTTCGA 
TTTAGATTTT TTCAGCTCTC TACAAGTTTA 
T CT AC ACT AT ATTCTAACCC CCCCCTCTCA 
GTACAGACCC GAGCCCTTCC TCTTAACACA 
GCCTCTAGGA AACAAAAG CA TCGTGGAAGC 
GATGCTGGCA CTTACTCCTT GTCCAGAATT 
CCATTTAGTA CTACGAAACT CTCTAGTCTT 
AAGTTAAAAG TAGAAAAAAG TAGACGAGCT 
ACGTGTAGCC TCATGTTTCC TTCTACTATA 
GCTCAAAAAA ATAAAACCCA TGAAATGAGC 
AATCAGGGGA GGTTTCAAGA GACAACTGAG 
TACCAAAAAG GGCAACCTCA C C CAGAGAC A 
TACCCTAGCA AATACAAAAG AGGCTCTGCT 
GCTCTCAGAT GTGCAAAGTG AAAAACAGAT 
CATAGGCAGG AGCAAGGCAA TAGAG CGATT 
CTATGACAGA ACCTTGGAAC CCCTCATCAA 
AGACCACAGT AATCATCCAA ACCAAAC CAA 
TAGTCATTAA GTATTTTCAT GCAGAAGAAT 
CATCGAAACA AAAAGTATTA GTGTAAAACA 
GTCCTACAGA AGCATTGAAA GATGTAG CAA 
AATATTAACT CACATTATTT TTCCTTTTTA 
TAGTAGTTAC CTGAAGCGAA GCACTTCAAA 
CTGTTGAGAT CTAAGTTGAA T CTTAGAAT A 

R gene exon 2 
TTGCTTTACC TTCTCCCAAA ACATACCCTT 
TCTTTAACTG GCTTTGGTGC AGTTTTTTCT 
CAGTTTTTCT GCTCTGTCCA AGTATTCCGC 
GTTTTGCTTT ATCACCCTGT GTTTCATCTA 
GCGTTGTTAA CAACAAAACA CACGTG CAT C 
TGTTATCAAG ATACCTCTCT CAGGGCTCAC 
TTACTTATTT TTTCCCCTAT GCCATGGAAA 
AACGCCTACA AATCAATGGT AAGTAAT CAC 
TGAACGTTGC TTTGCCTTAA AAACTTGCCT 
GCACTGGAAG CTTTTTCTGT CAGTCTCTCA 
ACTACTTTTC T AAACTAAG C CATTGAGAGG 
ACCTTTTTTT AATCTAAGAC CACTTTATTT 
GAAAGTTGTG CACATCAAAG GAAGAAAAAA 
ATGAGGAGTG GTCCAACAGC CACAGTTCTG 
CCAGATCAGC AATCAAGCTT GAAAATATTA 
AAAAGTC C AT CAGTATGGTT AAAAGCATAG 
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3 9451 GGAAGTAAGT GGCATGAGTT AATGAGCACA AAACAACCTG TGGATACTAC 
3 9501 TAAGAGTTCT TACAAGAAGG GAGCAGGCAT GCAATATG C A ACTTTTGTCC 
39551 TTGCTATAAT ATAACACCTC AGCCAAACTA C AG AG AG C AA GTGTCAACTG 
3 9 601 ACAACAACAG T CAGAAGTT A AACGTTGATG TCGACAGAGG AGACTACTCC 
39651 GGGCAATATA AAACTTGACT TCATCACCCC ATGCATTACA CTTACATTTA 

R gene exon 1 

397 01 ACAACGTGGA TTAAATACTG CACAGCATGC TGGTACAAAC GGAAGGCTTC 
39751 TT CAT AGTTT CCTGCTTTAT CTTCTTGTGC TGCCTTACTA GCGAGGTCTA 

398 01 TCGCTTTCTG TAACATAGGT AAATAATTCA AATGAGTGTT GTGTGAGTGC 

39 851 TTTGTGCGAT CAAAGAGGTT TTTAAGCTGC TGCTCTGACC GCTTCTTGGT 
3 9 901 GGCCAGCTTT TCTGCTCCTT GATGTTTACC CAAAAGAGCT GCTGTTATTG 

3 9 951 AAGACTTGCT GTCAGTTGTC TTCATCAAAT CCCATCGGCA TCAGTGTTGA 

4 0 001 TACTGGAAGT ACACGATTAC AAAG C AATGA AAGCAGCACC CTTTCCCTTC 

40 051 TGACCCAGTG CCAGGAGTTG GTTTCAAAGA CTCATTATTT GGTAAGCTTC 
4 0101 TCATGAAGGC TTTAGGTACT TGACGTACAG AAGTGAGAAA TTCTAACCAT 
4 0151 CTCTTCAGTG TGCATATGGG GGGGAGCTCA GTGGACAGGA AACATACCTA 
4 0201 AATTATCACA GAAGTT CTAT CAAGGACAAT TTAGAGATGG ATTTTTATTT 
4 0251 GTTTGTTGAG ATAATTTCAA ATACATCTGG TCGTAATCTA AGACACTACA 
4 03 01 TCGGCCTGTA GATATATTGA TATTACTGTT ATTCCTTTGA TCCCGAGTGC 
4 03 51 TTTTTATTAC ATTTCAGATT ACATTACAGA TTTTTATTAC ATCCTTGGAA 
40401 CATCCGTACT GCTTCAGGAC AATTAAGAAT GACAATTCCA ATGACTAAGG 
4 0451 CACGTATGCT TAAAAAAGCC AGAGTTGACT AACGCTACCT CGAACTTCTA 
4 0501 CAGCCCTGTC TGCATATTTC CACCTTCTGC CAGTTTATTT CCCAAAGGCA 
4 0 551 GGGACAGCGT GCTCGTGATG ACTGTGCTAA CATCAGGGAG CAAGGTGAAG 
406 01 ATATTCAACC TCATCACAGG GTTTTCACTA CACACTGCTG TGCACATACT 
4 0 651 CTCAACAGTA ACCAGACGCT CTGATGCATC TCAGTCAAAA CCGAGCAGAT 
40 701 AAACTGCAGC CATCAGAGAA GGAGGAACAA CATTTCTCCT TCTATTGTTT 
4 0 751 TGTCTTGCCT TTTTGGAAGT AGAGATCACC TCATTGGATC CAT CTGAAAT 
40801 CAAGAGTAAT TTATTTCAAA ACAATCACCT GACAAGTAAG ACTATGGATC 
4 0851 CTTTGTGACA AGTGTTGAAA ACAGAGCAAC CATCTGTTTC TTTGAAACAG 
4 09 01 AACTTGGTCT TTCCTCACTG CTGACCTCGT GCTGCCCTCT ACAAATTCAT 
4 0951 TGTAGAGGGC AAACCATTCA AATTCAGCAC AACAAAAATA AATTCCAAGC 
410 01 AATAATTTCT GTTACTTTAG TGATTTAATT AC CACAGGAA CAGTC CAATG 
41051 ATTCCTGGAT GCAGAACAAC AAAAACAGGG CTATGACAAA AATGACAATA 

CR1 -GG 

41101 TATCCAAACA ACAAATA AGA GTTGGACTTG ATGATCCCTG TGTATCACTT 
41151 CCAACCCAGG ACATTCTATG ATGCTAT GGC TCTGTGTTCT AAATGGCAAA 
412 01 GACCGCCTCT GTTCAATGGT AACTCTCTTA AC AGGG CAT C TTAGAGCCCT 

412 51 GCTCCTCTGA AATACAAAAA CAAAGGTCTA CATCCTGTGC TGACTGTTTT 

413 01 TGGTATTTTT TCAAATAAAA ACCCAGAAAA CCATCACTTC GGTTTTAGAC 

413 51 TCTCAGCTCT GGTACTTTAT TACATTAGGA AGGCTCTTAG CCTGCTACTG 

414 01 CAATGAAAAA CACCAGTAAC AAACAGGAAA TAATTTATGA AAGTTGTATG 
41451 AAATAAGGCA TAGCTGTAAC CATAAATGAG GCACAACCTG TATCTATGGG 

415 01 GCTATAGTTT GAGAGCTGGA TGAACAC CAC CCTCAGAACG AACATCGGCT 

CR1-GG 

41551 TTGCTCTTCT GCTTACTCTG GGCCCTCTGA TTTC ACAGAA GGGCGCAGGT 
41601 TGGAAGGGAC CGTAAAGCCC ATTCAGTTCC AATCCCCCTT GCATGGTCAG 
41651 GGCTACACCC CACCAGCTCA GTCCGCCCAG GGCCCCATCC AACCTGGCCT 

MAR (0.72) 

41701 TGAGCACCTT CAAGGATGGG GCACCACAGC TTCTCTGGGC AGCTGTGCCA 
41751 GGGCCTCGCA ACCCTCTCTG AGTAAGGAGT TTCTTCCTAA CATCTAACTT 
418 01 TAATCTCCCC TCTTTTGGTT TAAAACCATA CCCCTTTGCC CTACCTCTAT 
41851 C AG AC CAT GT AAAAAGTCAG ACTCCCTCCT GTTTATAAGC TCCCATCAAG 
41901 TACTGGAAGG CTGCAGCAAG ATCTCTCCCA GCTTGGTCAC TATAAGCACT 
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41951 ACATAGCCTT AAGCTTACAG GCATGGACAT GGTTTAAATA GGTTTAAAAC 
42 0 01 TACTTTTTGC ACAGATTATT CCTGGATCTA TTTTGAACCG GCAACACAAG 
42 051 CAGTTCACTC CCACAACCGA AGGCTAAAAT AAAATAAAAT AAAATAAATA 
42101 ATAATTAAAA AAAAAAAAAA AAGGAATAGA GAGCAGACAA GCATTTCCAA 
42151 GAGTCGTACT CTCAGCAGAA ACCCAGTCCA AACTACGCCT CCAGCTCACA 

CpG island 



42201 


GCAGGCCGCA 


GTCTTGCCTC 


AGAGGCCAAC 


GGGTCTTCTG 


GTCCCAGCCG 


42251 


GGCAGGTGAC 


TACCCGGGGT 


CCTCCGGCGC 


CTCCGAGCCC 


CCACCCAGGC 


42301 


CTGCTCGACG 


CCCCACCGCT 


GGTGTCAGCG 


CTTCTGCCCC 


CAGGCCCAGC 


42351 


CTGGCGCCCC 


ACCCCGCCGA 


GCCCGCCCTC 


CCACCCGCCG 


GCTGCAGCGC 


42401 


ACCGGGGTTC 


AACAGGACCC 


GCTCTACCTG 


CAAGTTGCCC 


GACATGGCGG 


42451 


GGAGCCGGGA 


AGGGGAAGGA 


CACGAGACGA 


CACTGGCTAC 


GGCCGACCGG 


42501 


AGCTGCCCTT 


CCCGCCACCG 


CCGCCCACCG 


AACCGAAAAG 


CCGGCCTTCG 


42551 


CTAGCCGCTT 


CCGCACCTCA 


GCGCCGGCCG 


GCCCGCTTCC 


GCTTCCGGGC 


42601 


AGCGCCCCGT 


ACGCGTCACT 


TGACGTCAGC 


ACGCCGCGCC 


TCGCCCCGCC 


42651 


CTAT CCGAGG 


GGCTGAGCGC 


ATGCGGGCCG 


GGCGCCGGAA 


GCGGAAGTTC 



427 01 GTGGGTTGGC GCGCAGCAGT GGTGCTGAGG GAATGGGGGT GGT GTTAGGT 
42 751 CCAGCACTGA CGTAGGGGAT AGGGCTGAGA TCTGATCATG ACCTACTGTG 

428 01 GGGAGCCTGC TGTAG CAGAG GTTGGGCTGG ATGCTCTCCA GATGTCCCTT 
42 851 CCAGTCCCTG CGATTCTATG ATCATTTCTG TAAAATGTTA AATAGT C ACT 
42 901 TATAGGGTTT TGAATAAATC ACGTTTTTTC CTCATGCCTC ACGTTTGGGA 

42 951 CACAAAGACA TTTTTTTCTT ACATCTCTTC TTTCTCGTAC CATTTGCTTG 

43 0 01 CTTTCAGCGG CACTGTCTTT TGCATAATCT GAGTGCAGAA TGCTTTTTAT 
43 051 TCACAGAACC AGCTCTTAAT AATTCCTGAC AGTCATAAGC AGTCAGGCGT 
43101 TAGTCACCTG CAGCTCAGTA ATGAAACTCA ACTAACAGGT CTGCAGAGTA 
43151 AGAGCAATGA CGTGACTCAG AAAGC ACAG C ACATTGTAAA CAACTCTTGT 
43 2 01 AAACTTGCTA TATGGGTTTC AGACTAATGA ACTTCTGCTA AGTCGGTGCA 
43 251 ACAGTTGTGT TAAATTACTG T CAT AT C CTT CCCTATGTTA TTGTAATACT 
4 33 01 GTTGAGGAAA TGCTTCCTTA GATTCACAAT CCTCGTTTTT CTACCTGCCT 
43 3 51 CCAACTAAGC CCAGTACAGT CTGCTCTGGG ATGAAGGTAA AAGGCACAAG 
434 01 CACAGTCAGC CCTATATCTA GGAAGGTTGA TGTAATTTCT TCCTAAAGTC 
434 51 CTCTGCTTGG CAGCTTGTTT TGCTTAATGT CTT CATATGT GCACACCAGG 

S gene exon 1 

43501 CAGGATGCTG AAGGCTCGTT GTTTGGGGAT GATCAGTAAC AGCTGTTCTT 
43 551 CTATTGCAAA TGTGAAAGGG TACAATGTAG CAAAAATTCC TGGATGTAAT 
43601 CAGGCTCTGG GAAATGAGAA GGCAAAGGAA ATGTTGGAGG TAAGAGCAGC 
43 651 GTTCAGGAAC CAGAATGATA TGGGTTGGAA GGGATCTTAA AGATCATAGA 
43 7 01 AT C AT AG AAT CGCTAAGGTT GGAAAAGACC CACAGGATCA TCCAGTCCAA 
43 751 CCATTCACCC AT CAC CAATG GTTCTCACTA AACCATGTCC CTCAACACAA 
43 8 01 CATCCAAATG TTCTTTGAAC ACCTCCAGGG TCGGTGATTC CACCACCTCT 

S gene exon 2 

43 851 CTGGGCAGCC CATTCCAGTG CCTGACCACC CTTTCAGAGA AGTAGTATTT 

43 901 CCTAAAGTCC AGCCTGAACC TTCCCTGGCG CAGCTTGAAG CCATTCCCTC 
43951 TAGTCCTACC ACTAGTCACA CGAGAGACGA GGCCGACCCC CAGCTCACTA 

44 0 01 CAACCTCCCT TCAGGTAGTT ATAGAGAGCA ATAAGGTCTC CCCTGAGCCT 

S gene exon 3 

44 051 CCTCTTCTCT AGACTGAACA ATCCCAGCTC CTTCAGCCGC TCCTCATAAG 
44101 GTCTGTGCTT CAGACCCTTC TCCAACTTTG TTGCCCTCCT CTGGACACGC 
44151 ACCAGGCTCT CGATGTCTTT CTTACAGTGA GGGGCCTAAA ACTGGACACA 

442 01 GTACTTGAGG TGCAGCCTCA CCAGTGCTGC GTAGAGGGGG AGTCATCTTG 
44251 TTCCAACCCT GTTTTCCTGT AGGTAGTATT TCTGGCTGTG CCATCTGTAC 

443 01 CTATGGTTTT CAAATCTGTA ATGCTACACC TAGCTTTTAG AC CTAGGTCT 
443 51 AAAACAGTAC ACAAGTCACA GGCATGTTAG TAATGCCTCT CCAGTCACAC 
44401 TTTGCAGTCT TCCGAAACTC CACATATAGA CATGTTTCTA TGATTGTGAA 
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44451 TGAGATTAAA AAAAAAATAA ATTAATAAAT CAGAAAAGGC ACGTGTATAT 
44 5 01 TTACAGATAA CAGGCTAAAT ATTATACTTC TTAATTAAGC TTTACTATAC 
44 551 AGTATTCCTG TTATGTGACT TTGCAGCTAG TTTTGCCTAA GGAAATACTG 
44 601 GCTGAATGCT GAGTAATAAC ATCACGACAG ACTCCTGAGG AG CT AATGAA 
44 651 GTATTACACC AAGAGTGTAG CTTCAGTTTG AGAGACGTGT ATGGTCACAT 
44 701 TTTGGAATGC TTCCCATTGC TGAGTTGCTG TGTTACAATA TTCTCAAAAT 
44 751 CCGTGTCAGT TATTGTGTT C AACTGAGTGT AATGACAATA AAATATATTA 

CRl-GG 

44 8 01 ATGACGTTAA ATGAAGATAT C AT AGAAT C A T AG AAC AT C C CAAGTTGGAA 
44 851 GAGACCCACA GGGATCACCA TGTCCAGCTC CTGGCTCCAC ACAGCACCAC 



44901 


CCAAAATTCA 


AAGTTGATGT 


CTGAGAGCGC 


TGTCCAAATG 


CTCCTTGAAC 


44951 


TCTGGCAGCT 


TGGGGCTGCC 


CTGGGCAGCC 


TGTTCCATAC 


CCACCACCCT 


45001 


CTTGTTCCCT 


CGGGCTCTGT 


CGCAGTCACA 


CAGAGCAGAG 


CTCAGCGCTG 








CR1-GG 




45051 


CCCCTCCGCT 


CCCTGCGAGG 


AGCTGCAGCC 


GCCACCAGGC 


CTCCCCTCAG 


45101 


CTCCTCTGCT 


CTGGGCTGAA 


CAGACCAAGG 


GCTCTCAGCT 


GTTCCTCATA 


45151 


CACGTTGCCC 


TCCAGATCCT 


TCCCCATCTT 


TGTGGTCCTC 


CCTTGGACAG 


45201 


TCTCTAATAG 


TCTTATGTCC 


TTATATTGTG 


GCACCCAAAC 


CTGCACCCTG 


45251 


TGCTGGAGGT 


GCAACTGCAC 


AGCACAGAGT 


AGAGAGGACA 


ACCCTTTCCT 


45301 


GCACTCGATG 


GCAGTGCTGG 


GCCTGATGTA 


C C C CAGGGTA 


TAGTTGGCCC 



45351 TTTGGGATGC TAGGGCACAA CGCTCAGTCA CATTCAACTG TCTGTCA ACA 

454 01 AGTACCTATT GGCCTGCATG AGGCCTGTCT GCTAATTGGG ACTCTATTAA 
45451 AT C AC AT C AC TGTGACACTA GGTGGCACAG GCACACATGA TCTCCATGTT 

455 01 CCTTAAGGCT GAGTGAATCA TGGAGAATGC TTCCTGCTAT CAGTTTTTGG 
45 551 CATGGAAAGA GAGGAGCCAA ACCACCGGTT GGTTCAATGC CTTGTGCCAG 
45 6 01 GAATAGGTGA ATGCATCAAT ACAATAAGTC ACGTCTACAG CACAGC CAGG 
45651 CCTCATGTCA GCAATACTGC TCCACTGTGA TAGCTGAAAG TGACTATAAA 
45 701 TGACTAACGT TAGTGTGGGA CTTTGGTGTT AGATGACGTG AGAGCCATGC 
45 751 AGTGAAAGAG AATTAGTGTG GCAGAGTATC TAACAGTGCA GGTAGATAAG 
45 801 GCAGGAAGGA TAAGTGTAAG GAAAGATAAG GAGAAAGGCA GGAAAG TAAA 
45 851 ACCTCTGTTT TTCTCTAGTT TTCTACCTGG TGAAATGATG AAGAAAGATC 

45 901 AGTTTGACAT AGGTTAACAA AAACTGTCAG TAAGAAAGGT AGGAGTTAAG 
45951 ATGCATGTTG TCCAAATCCC ACTACATTAC TTTGACCCTC TTCAGCATAT 

46 0 01 GCACAATGAG ATCACTTGCC CAAGACAGGA CCTCCAGTGG GCATGAAATC 
46 051 TGAAAATCAA TTATTTGCTA TTTGTGTTGC TTATCATTTC CAGATGAAAT 
46101 TCTACACGAG ATAATTAGAG TGATGTCCTT GAAGAT CAAC CTTTTTGTCT 
46151 AATTAAGGTA TTTGCTATAG CTTCCAGATG TATTGCTTAT CTATGATAAA 
46201 TATCCTTCCT AACTACAAGG CTT CT ATAAT AAGAGTAACG TCCTCTATAG 
4 6251 TAACCAGTAG AAAGTAGGTG GAAGCTGGGT GTTCTTAGAC AACCTGTGCC 
4 63 01 CATACATGGA CAAAGTGAGG AGGAGGACAC CTCCCTAAAT G AC CAC C AG A 
4 6351 GACCACTGAA GACCCACATG CAAGCACAGA AGATTCAGAT GTGTTGGTGT 
46401 AACCTTGTAA ACGCAGTAAT CTCGTGAATA TGTGATAGAT AGGTGTGCCT 
4 6451 TATGTATTAG ATAGGCGAGT ATTGAGAACT TTTGGTTTAT GGATGTGGAT 
4 6501 AGTGCTGTTA TCCATCTTGC AC C CTGAG C A TAAATAAAGC AATAT CTCTT 

S gene ex on 4 

46 551 CTATAGTGCC TTGTCTTTTC ATTGTATTTC AGGAGACTTT GAAACTGACA 

4 66 01 ACAGGCATGC AGCTTGGGAG TGCTCACAGT CAGTCTGGCC ACAGTGCCTT 

4 6651 CAAGCCTCCC CTGCACTGGG ATGTGGTGTG ACAAAAAGCA CAAACACTGC 

46 7 01 TTTTGTAGAA GACCCAGACC ACAGGCTGCA CTAGGGAACG TGTCTGCCTG 

46751 GAG C AC AGTG CCCTGGGGAG TGCTGCTGGT ACAGTAGTCC TGGATGAGTG 

46 8 01 GCTTCCTTCT GTAACCTTTT AATTGCACTA GAAGTAC AC C AGCATGGCAG 

4 6 851 AG AAGGG CTG GGTCCTAAGA GCCCTTCTTT CAAATTCACT CAGAACTCCA 

46901 GATGTTTAGG CAGGGTGTTG TAGCTGTAAA GTCCAGGAAG AAAAGGTTTA 

46951 AAGCTGTACT CGGCACCAGA AAGACTGGAG CCAAAATAAA GCCACATTGC 
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47001 
47051 
47101 
47151 
47201 
47251 
47301 
47351 
47401 
47451 
47501 
47551 
47601 
47651 
47701 
47751 
47801 
47851 
47901 
47951 
48001 
48051 
48101 
48151 
48201 
48251 
48301 
48351 
48401 
48451 
48501 
48551 
48601 
48651 
48701 
48751 
48801 
48851 
48901 
48951 
49001 
49051 
49101 
49151 
49201 
49251 
49301 
49351 
49401 
49451 
49501 
49551 
49601 
49651 



AC C CATGGCA 
TAGAAGTTGA 
CTACTGTGCC 
GGT TAGGCAT 
CAGCTGCAGC 
CCCTTCCAGC 
TCCCACAGCA 
TCTAGGGGAA 
GTGATATACT 
GCTCAGTGGT 
GAGAATTCAA 
AAATTAATAT 
AATGGGAAGC 
AAGACATTCA 
TACAGGAATG 
TACGTTTTGA 
TCCACTCCTG 
GATTTGGGTT 
AGT CAGTTTA 
TCACAGTGAG 
TGAAAAACAT 
GAATTTGAAG 
AGGAACAAAA 
TATCCTCAAG 
GAGCCTTTCT 
TCTGTATAGA 
GCATTTTATA 
GTCCCTTCAG 
CTGTATGGTT 
CCTCTACCCA 
GGATGTAGCA 
CAGGCTTAAC 
GAGAAGCTTC 
TATACTGTAC 
AAACCTAAAG 
ATTATGTAAG 
AATCAAAAAA 
TCTACAGCCT 
GTGTCTCACA 
TGTTGCTTAT 
AGATTT CTAG 
CAGCAAAGCA 
TTTAT CAT CA 
GGGAGGGATT 
GATGAAGCAG 
CCCTGCTGTG 
ATCAATCTGT 
AAGCTTCAGA 
AGTAAATGTG 
AAAATCATGG 
ACATGTAGCA 
CAATGGAAAT 
CTGGCAGAGG 
GCGAGACTTG 



CTATAGGCAA 
GGAGTCCTCA 
CCTACCTGAA 
CCTTCTCTCA 
AGCAGCCAAA 
CATGACTGGT 
GGTGGGAGAG 
CACTGTAAAG 
CCATGCCCCT 
CACATGCCAC 
CCTTTGATAA 
TTCCTCTTCC 
TTAATGTAGA 
CAGTCTGTGG 
GGTCTGGTAG 
GCCAAATTCC 
CTGGAAACTG 
CAGGTGTGTG 
TTCTAGAAAT 
ATTAAT CC AG 
GCTCCACTTC 
TTTTTCTTCT 
ATGAAATCTG 
GGTAAT C ACT 
TCTTCATGAA 
GAGC CAAGGA 
TATTAGGTCT 
TGATGACTTC 
GACTTACATC 
TATATGAATT 
GCATTTCTCT 
AGCCCTACCT 
CCAGCCAGTA 
CATTTGGATT 
AAGCTTTGGA 
AGTGACAAAT 
GAAATAATGG 
CTGGTATGCA 
GTCAGACATA 
CACCACATGA 
GAGTTAGGGC 
GGCTCTGTGT 
ATTCTCATCT 
GTTGAAAGGG 
GTTCCTTACA 
AGCACTCAGT 
ATTGCATCCA 
AAGTGTAAAC 
TGCAATTTTT 
AATGGTGGAG 
GTGGACAGTG 
GACTGTGACC 
CCAGCGTGCA 
GAGCAGTGAG 
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AGGGTAGCCT 
GACTCTGTGT 
GCCTGAATCA 
AAGCTCTTGG 
GGAAAATTAG 
CCCTTCTCAC 
ACAGAGGCTT 
TCACAGGGGA 
GGCGTTCTTC 
CAAAGTCTGC 
ATCATCTGCT 
ATCCCCTACT 
CTTGGGTCTT 
CAGATGGTAG 
TACCTCTTTG 
AT C AGATTT C 
AAAATATGGA 
TATGCAGGAA 
GGGATGACTC 
CCAATTGGGA 
TATGTATTTT 
ACTGTTACAT 
AAATAATATT 
GTCCTAAACA 
CTCATCAGAA 
CAAATATTTC 
TGCTAGTGAA 
ATTCATGATT 
ATCCAATTTT 
CAGCGAGGGA 
TTGAAATGTC 
GCTTCAGAGA 
TTTCATCAAG 
GCTGCATGTT 
GCAACTCCTA 
CACTAC C AGT 
ATCTCACAAA 
AGGGTTAAAG 
AAT CTGTAAG 
GCAATAAGTA 
TGAAAGCATC 
ACCTGTCTAG 
CCATCAACTT 
ATCTCTGGAG 
ATAGGTGGCA 
GTAAAAAGAA 
AACCAGAAGA 
AATTCACAGA 
ATTTTCCCCA 
GGTGATGGAG 
AGGTCACCGA 
TCGCTAGCCC 
CTTGTGCCTG 
CGAGTTGGTT 



TGGGGCAAGA 
CAAGGGGATG 
GTACAAATGT 
TCTTTGCACA 
GTCTTGCTTT 
TCCACATCTG 
TCTTGAAGAA 
GGAGACGTGG 
CCCTGAGTGC 
ATTCATCTTT 
TTGACAAAAT 
TTTACAGGCT 
ACTAAACCAT 
CAGTATATTT 
GAAAGGAAAA 
TTAGTTTTGT 
AATGCTTGGA 
ATGTGTTACC 
CACTTTTATA 
AAACAGCCTT 
TTAATATACT 
GCATTCCAAC 
TTTCTTAGCT 
ACATACTTAT 
AGGGACACTC 
GCCTACGTCT 
TTATGACTGA 
GAATAAATGT 
GCCATCTGCA 
TTTTGTACTA 
TTTACAGATG 
CACTGCTGTA 
TTAAAAAAAA 
GACATCATTT 
AGTGTATGGT 
CTTCCAAAAA 
ACTGGATCTG 
TAGAGTGATT 
CAGGTCCAGG 
AT CTGAAC AC 
ATGAAGTTTA 
CCACATTGTC 
TTAAGAATTA 
ATCATCTAGT 
TAGGAAAGTG 
AGCCTGGAGT 
GGCAAAAAAA 
AGATGGATTA 
ATATGTCACC 
GCCTAGCCTG 
CCAAGCGGTT 
TCACTGTACA 
GACTCCCGTT 
GAGTTGTGCT 



CTTGATGTAC 
TGCCACAACT 
CTCACGCATG 
CTTTCTTCTG 
GAAAGCCAGC 
TGGATGATGC 
ACCCAGCCCC 
CTTTGAGACA 
CACTGGTGCT 
AAATGCTGCT 
CGACATTTAA 
GGCTCAAGAA 
TTCACTGGGA 
T CT CT CAT AG 
TGTAAACTCA 
TAGTTTTCAC 
AATTTACTGT 
TTCCAGAGTA 
CACTTGTAAT 
TCTTAAATTG 
TCAGCATTGT 
AGAATTTGTC 
TTGCATGTGT 
GGCTGTTTCT 
ATATTGGCAG 
TCTCTGCGTA 
ATGGAATACA 
AGCTTCAGGG 
ACAGCCAACA 
TGTGTTGCTG 
CAATG CCTAG 
AAAAGAAAAA 
TCTAAAAGTT 
AGGATT CTGA 
AGATGCTCTC 
TGCATGCTGA 
CAGATCAGGT 
GTTGTAGCTT 
TTTTGTAAAT 
CCAATGTAAC 
TTCTTTTCTA 
TCTGACAAAA 
CAGAATTGAA 
CTTACCCCAT 
TGAGCAAACA 
AGAGACCAAC 
GTGTCTCACT 
TTGTGGAGAG 
ATTACAAAGG 
GGGCCCCAAT 
GTGATGT CAG 
GATTTGGGAT 
GAGCATAGCT 
GTGGGGCTGC 



WO 2004/094640 



PCT/US2003/039244 



49701 TGGCAGCAGT TCTTGGTGCC 
49751 GCCCTGCCTG CCTCAGGCAG 

498 01 CAGAGGTGAG TGGTACAGTG 

49 851 TGCTGAGGGA C C AGGAC AT C 

499 01 TGACCATGGC CTCTCTCTCT 
49951 GCAGCACCTG TGAGGGG AG C 

50 0 01 TGCACAGCCC GGGGCCATGG 
5 0 051 GCGGGCAAAG TCGGGAGGAT 
50101 CTTTGCTATG GCTGTGCCAT 
50151 CATATGTGGG CACCAAATCA 
502 01 ACGATTACCT CGAGTGTGCT 

502 51 GGCCTGCAGG ACGAGCAGGG 
50 3 01 GCACAACTTC CCCGCCGAGG 

503 51 AAGACCTCGA GCCCCTGCTC 
50401 ACCAGAAATG GGTGGTGGGC 
50451 CCTCCTTTGT ATTTTCGGAC 
50 5 01 AGCCATTCTC TGATGCTGAC 
5 0 551 ACCGCTGCAG CCCTGTACGG 
50 601 GGAAGGCTGT GCTGCAGGAC 
50651 GCACCACCAC CTCCCATCGG 
50 7 01 ACCAGTCCCG CAGGGCCCAG 
5 0 751 GGGAGCTGGG CACCCCAGCA 
5 08 01 AGGAGGAGCC ATGGCAGGCG 
50 851 GATCGCTCGT GTGGGGGGCC 

509 01 CAGCCCCCAG GCCTCACCCC 
5 0 951 GCTAGGCCGG CACCGCACTG 

510 01 GGCCAACATC TACCTCTCGG 
510 51 CAAGAAAGAA TATTTAGAGC 
51101 GAAAGGAATG CTGAGGACAG 
51151 TGAGCTGCAT GATCGCTGCC 

512 01 GC T AGGAATG ACTCATAACT 
51251 AAGGAGTAGA ACAAGAG CAA 

513 01 GTAAAGGGGG TGGGAAAGAT 
51351 GATATTAGAA TATGCAATAA 

514 01 GTGCCTTGGA TGCTGCAAGA 

514 51 AAAAGGATAA GAAAATGTCT 

515 01 TGTCCTTTCC CATGCTGCTT 
51551 AGACATGCAG CTTTATTTCC 
51601 TAAAAAAACA CACGAGGGCT 
51651 TCTAATAATA ATCCAATAAT 

517 01 CAGCTCCAGG C C AC AGATAA 
51751 TGTGATGTCA CCCTCTCTGT 

518 01 C C AAAAAT AT GTGCTGTAAT 
51851 GGTACTTTGG GTTCTACAGT 

519 01 TTGCTGTAGC TTGTGCACGA 
51951 CCCAGATTTT CTTTCTGGAA 
52 001 TGTTTTTTGC TCTCATCAGA 
52 051 GACAGTGAGG AGCAGAAAAG 
52101 CATTACCAGA TTCTTCTAGA 
52151 CATTACACAA TAGAATAACT 
52201 AGTAAGTTTT GAAACTGACT 
52251 CTCCATTCTC ACAGTAGATT 
523 01 GTAGAGAGTA GTGTTCACAA 
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CACCCCACAG TACCACCAGC GTTTCCCCCA 
CTGGGGCCAC ACAGGGTGCA CTTGTAGCAG 
GGGAAGTGGT GGGGAAGTGG GAGGGTTTGC 
TGGACAGCTG CCTGCCCATG GGACAGCGAG 
CTTTGCAGTT CGTAACACCT TCTGCCTGCT 
AGTTTCCTGA CCTCAGCTCT CCCAGCCCAC 
T gene 

ACGTGCCGTC CAACTGGACC TGCCCCATCT 
GTCACCTATG TGACCCCCTG CCAACACCAG 
CTGGTGGGCA GAGAAGAAGC CGAGTTGTGC 
CCACTATCCG ATACTCGGTG AGGTCGGATG 
GTCCCGCAGC CCGCAGCACG CTCAGATCAC 
GCCTGCAGAG CCGGTGCTCA TCCCACCTGA 
TCTGGGCTGC ATTTTTTGAT GGACATCCCG 
CACTGGCTGC AGGATGAGAT CCAGCAGTTG 
AGTGTGTGTT GGACAGTGGA CTGTTGTAGG 
TGGACGAGGA GGCCTTGGTG CAGGAGCTGC 
TTGGTGCCCT TTGTAAGGCG GCTCATCAGC 
CCCAGTGATC CGCCGCCAGC TCGACCAGCA 
AGCGGGAGGA CAGCCCCGCA GCCAGCCCCA 
GAGCCTCCTG CCTTGCGCCC AGGCCGCTCC 
CACCGAGGAG CTGCCCGGCA GCTCTACTGG 
CCACCACCGC GCCCTCAGTG GAGGAGCCGC 
GTGGCAGCGG GCCCCTCCAC CCAGGGCAGG 
CCGGCGCCCC CCGAAGAGGA AGGCCCACAG 
CGCCCCCCAA AAGGCGGCCC CGACGGCGGC 
CCGTCAGAGC ACGGCTCCAG TGGGCTGGGA 
CCTGCTGCTT G C AG AT AAAA TGTGGGGATT 
ACAAGCTGCA GAACAAGATA AACAGCATGG 
AGGATGCCTC CAAGAGAGAA GAAAGTCAAG 
TAACAATCCT AATTGGAAGA AGAGTATGTG 
CTGATTGGAG AAGCGCCTGC ATGCGTGGTT 
GGGTGACCCT GTGGGATGTT TTGTTGACAT 
AC C AG AG AAA ACTTGGCAGT GTATTTAAGG 
ATGATTTGGA TTGCTCATAC ATCTGAGTCC 
AAATAAACAG AAATTCAAAA AAAAAAAAAA 
CTGTGTTATT GACAAGGCTG TGGGCGTTGC 
TCTCCCTCTT TTTCTCCTGG AGGTGAGCAC 
ATGACCATAA ATTGGCTTTC ATGACAGCAC 
CCAACAAACA GAGAAAGGAA CTTATGTTAC 
CAGGGCTTCA CTAATTTCCT CTCATACTGC 
TTAAGTTTTG TTTGATTTCA GTGACTGAGC 
AGACTTCCTA TTAGTCTGAT GTTAAAAACA 
CCAAAGAGAA ATTATGGGTC CCATTAAATT 
CTCTGTTATG C AAGAGTT C A AGCTAAATGA 
GTTTTGAAAA GATACCAATC TGTGAACAGA 
TTCTCCTCCC CTGTGCAAAG GAAAGCACAT 
GAGTACTCTG AAATGAACAT TTTTGAGTTA 
AAATTCTATT CACATAGGTG CTTTTAAAAG 
CAAATGACAG AGGAATAACT TTTGCCATTC 
GAACTGCAAA AC AAAGAGT C ACGCTACAGG 
TGCTTACCTC TGATGCTTCC AGCTGACTTT 
CAAAGTTCTT TTTTTTTTAA CTGTGTGACT 
CTTAACTGCA TGCTGTGCAG TCTGAATTAG 
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52351 
52401 
52451 
52501 
52551 
52601 
52651 
52701 
52751 
52801 
52851 
52901 
52951 
53001 
53051 
53101 
53151 
53201 
53251 
53301 
53351 
53401 
53451 
53501 
53551 
53601 
53651 
53701 
53751 
53801 
53851 
53901 
53951 
54001 
54051 
54101 
54151 
54201 
54251 
54301 
54351 
54401 
54451 
54501 
54551 
54601 
54651 
54701 
54751 
54801 
54851 
54901 
54951 
55001 



AG CTGGGGGT 
ACTGTGGTTT 
AGTTTAAACA 
CAGAACTCTT 
CATCCAAGAT 
GGTACAGTGG 
TGGACCTGTC 
GTGGCTGGAC 
ACCAGATTTC 
CTAAGCCTTT 
AAGTCATTTA 
CAT CTTAAGC 
GTCTATAATT 
TGT CT C ACTA 
TGGTAGCCAT 
CGGAAGGGTA 
TCCAAGTCAC 
ATTAACTCGC 
GCTTAATGGG 
TAGCTTACAT 
TTCTGTTATC 
TTTATCATTC 
TTCTGTGCTT 
AGCTGAATCA 
CTAATCAGTT 
ACATCTGGAT 
GTGAGACAGC 
GTACACTGAT 
AGCAAGAATT 
AAAAGCTCCA 
TGGAGGCAGA 
CTTTTATTGT 
AGAACTCTGG 
TCATAACTAC 
GGATAAAAGC 
CTGCACATAG 
AAGGGAGGCT 
CCATGTAATC 
TTCAGAGTTA 
AAGCACTTCT 
TTACGTGAGC 
CCTGATAATT 
TATAATCAAG 
AATGTACTGC 
AACAGTACTA 
AGGTTTGAAA 
TAAGCATGTT 
AGCCCTAGTC 
CCAGAGAGGA 
CTGAACCACA 
GGACTGGATG 
CATCTGATTT 
GTTGGGAGCA 
CCAGCACTTT 



AGGTGATAAC 
GTCTCATTAT 
TGCCTTCTGC 
ACTGTTCTTG 
GTCCTCCAAA 
TGTTGTACCA 
GTGCAAGAAT 
CAGGAGTAGC 
CAGGTTAGCA 
TAATTAGTTC 
TCTTTCTCTT 
AGGAAGCTCT 
CTTCTACACG 
ACAAGCCTGT 
CTTTGCAGTG 
TTTGCACTGT 
TTATGC CTGT 
TGGATTATGC 
ACCATTAGAT 
ATTACCTGGG 
AATTCCTAGC 
ACAATTTTTC 
TGTTACACAA 
GCTGCTGAAT 
TACATTAGCT 
GTCAAGATGG 
GACACTGTAG 
TTTGTTGCCT 
AACATGTTTG 
TCCTCTCCTG 
CCCATTTCCA 
CCTTTAATAT 
GAAGGTGTGT 
CTGGATTACA 
AAGAGGGGAA 
TAAGGAGGCA 
GTGTGT C AGG 
GCTGGACATG 
GAGGGAGAGG 
GCCAATGTGA 
AATCCTATCC 
CGAAAAT CAC 
CACAGTATGG 
AGTGGTAATA 
GAGAGAAGCA 
GTATCCCTGA 
TGGGTTTTTT 
ACAGCCTGGC 
AGGAAGAACA 
GTGCACATGT 
CTTCAGTGGG 
ATCTGTGCCA 
ATGTCTTTAG 
CATTTCATCT 
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ACACCTCCTT 
TTTCTTCTAA 
CTGCCTTAGA 
AATTCATGGG 
CAATACGTTC 
GGGGGTGAGC 
AGAAAGAAGT 
CAACTATGTG 
TTTTCCTCTT 
TTGTATGGAA 
CATCCTAGCC 
TCAAGCCAGG 
TATGACATTT 
TTCTGTTTTT 
GGCTTTGGAT 
CAC AAGTTC C 
TATGTAAATG 
ATGAAGTATC 
CT C AGAAGAA 
TAGATGTAAT 
ACTTTCTTTT 
CTACTTTTCC 
AT CT AGGG AT 
GTAGGAATAG 
GAGGTTCAGG 
GTTTGCAGGT 
GGTGTTTTAC 
TTAGCAGATC 
TTCCTCGTCT 
AAAATACACA 
GTGTCTATTT 
GCATATATTC 
GCCCTCACCC 
GGAGAAAGTG 
AGAAATCCCC 
CTCAAGGGCA 
GCCCAGGCCT 
TCAGTTCCTA 
TCTGCTTGGT 
TCTACTCTCT 
CCATGCAGGC 
ACCTACACTC 
TGTGGCTATG 
TGCTTTTAAT 
CACCAGGCCC 
ATTAAAATAA 
TCTGAGACAA 
AAAAGGAACC 
GTGTTGTCCC 
GTGGGGGTTC 
AAGTGATTAA 
TACCAAGGCC 
GGAAAAGGTC 
TTCATTTCCT 



CAACTGTTTT 
ATGCTATTTT 
ACTGCAGAAG 
AAGGT CTGAG 
CCTCATTCCA 
ACTGCAGTGG 
CCCACAACAG 
GCTGCTGTGA 
AGACCCATCC 
AGTAGCAGAA 
ACTCTTACCA 
CTATTATTCC 
TGTCTACATC 
TATCCACAAC 
CTTGACCCAA 
TATAGACCTA 
TTAACGCTAT 
CTCTGGAGTT 
TGACGAAAGC 
GGGAAAGAGA 
GTTAAATATA 
TTTTTTTATG 
CCTGGGTTAG 
CTCACTTGCT 
GCCATTGTTT 
ATAACTTTTA 
TTCGAGTAAT 
TGCCAAATAC 
TAGTTGCATT 
GCTGGAGAAA 
CAGCAAATAT 
ATGCTGTGAA 
ACATTAATCA 
ACT CAT CT AC 
AGTTCTCACA 
TAATGCAAAC 
GCTGCTGTGG 
CTGCTCCGAT 
CTCTCTGCTA 
CTTCAAAGTT 
TTTCTTTTGG 
AGCTAGGAGC 
T AG AGGAAG C 
AAGGCAACTC 
TGAAACTTCA 
TTGAGGAAAG 
GCATCGTGTA 
TGGTGCAGTC 
TGTCCTTGTG 
ACAATACTGT 
TCCAAAGCAC 
AGTTATGCCC 
AGATG C C AAA 
GTTCTCCAGG 



GTTTTCCTGA 
AAGCAGTAAG 
ACCTTAAATG 
GAAATGGGGC 
TTATGT GTAA 
TAAGTGCTGT 
CCAAAGTCCA 
TTTGATCCAC 
TTATTAATCC 
ACTGTATAGG 
GAGTAATTTT 
ATCATAAACT 
TTCCAATATC 
CCATCGAATT 
GAAAGGAAAA 
ATTGCAGCTT 
TGTGGAGTTT 
TCCCCATCAA 
TATTTCTCAG 
AAAAGAAGCA 
GGCTATTTTT 
GCCTAGTATG 
TGGTGATATG 
TTCATGGGTG 
GTTAAGATTT 
TAAGTGACTG 
GCAGAAGAAT 
CAACTGAAGA 
CAGGGACAAG 
ATTCAGACCA 
TGACT CTAAG 
TCTATGCTGA 
CCCAGACACT 
TGATGACGCT 
CCTCCCTCCT 
CCAGAGCTGG 
GCAGCAAAGG 
CTGAAAC CAG 
CT C ATGGGAA 
TCCACATTGC 
TAGGTGAGCC 
TAACCAGATC 
AT CTT CAATA 
TGTTCACATG 
GGG CAAACAA 
GTGACAAACC 
GGTTGTTTTG 
ACTATGGGCA 
AGAAGGAGCT 
CTTCCTGGAA 
TGTCTCTCTG 
AGTGCTAAGA 
TGATCTGATT 
TTAAAGGCCT 



WO 2004/094640 



PCT/US2003/039244 



55 051 TGTTCTCACT GAAAGCTGGC 
55101 GTTTTATCAT TATTATTATT 
55151 ACAGGCTCAA TAAAATAAAT 
55201 AAATATT C AC TGGTGAGGCA 
55251 AGGCTGAGTT CTTGCTGGAC 
553 01 CATCCCTCAT C CAT CT CAGT 
55351 GAGAGGTCAG CTTGCTGCTG 
55401 AGAATGACAC TTGATTATTC 
55451 TGGTTTGTAA CTGTCAGTCT 
555 01 TCTTGCCTGT GACTTCTCTC 

555 51 CAGTGAATAC AGAAAGTTCA 

556 01 TTTTACCCTT TGTTTTTTCT 
55651 GAAATAAAGG CATGCAAATA 
55 701 GCTTGCTGTA AAAGCCGTAG 
55 751 CTGGCAGCAT AATAAACATT 

55 8 01 CACTTGTAGC CAATTT CCTG 
55851 AGACATGTCC TGAGTGTCAA 
559 01 GACAACAGCC GTCAGTTGGA 
55951 ACTGTGTTGA GCAGCAGGAG 

56 0 01 GGGTGGGAGG AAGGAAGCTG 
56 051 ACAACGCCCT GAAGGAGTGG 
56101 AGACAGGATG ACAGGAGTGT 
56151 GCATGGCCAG GTGGTGGCAC 

562 01 GCAAGGCCTT CGCTTGTGTT 
56251 TTGGTGACCT GGAAAAGGAA 

563 01 GACTGTGGCT TGTCCTTGTT 

563 51 TTTCACTTTG CACATCCCTT 

564 01 T C AGAGAC AG CATCATGACA 
564 51 ACAGCTTTGA TTTTATTATT 
56 501 AAAATCCTGC TCTTCATTCT 
56 551 TTGCAATAGA TGTGTGTTGA 
56 6 01 AATTCGGTGA CAGGTAAATG 
56651 AGATGT C ATT CAGTCAGACT 
56 701 GTGCTTGTCC CTTGCAAACC 
56 751 TTTCAAGTCA GCATCTTTTG 
568 01 GTAAGATCCA GGTAAGTGAC 
56 851 ACATATAACA TATACATAAA 

56 9 01 GTGACAC CTA ATATTATTTC 
56951 AATATCCATT CTCTGGGGAA 

57 0 01 AGGAGCCCGT GTGGACTCCT 
57 051 CTCATGGCAC TATGATCAGA 
57101 AGGCAGAAGA GTTCTGTAAG 
57151 GAGAAGTTAC AGAGTGGAGC 

572 01 AATGTAGAGT TCTTTCACCT 
57251 GCACCCCCTT GTATTTGTAT 

573 01 TGGTGATAGG AGGTGAAGGT 
57351 TGAGTTGTCA GCCACTCCCC 

574 01 CTTCACAGAG GGCCGTCTTT 
574 51 CCTTTGGCTC TAGATTACCT 
57 5 01 CATTATTGCA GATTTTTATC 
57551 AGATATGTGA CAATAATTGC 
57 601 GATGTTCTTT GAATTGCTTA 

576 51 ATTCACTTAT GGTAAAG CAT 

577 01 AATGAAGTTG AACCCTACTT 
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AACTGTTTGG CCGCCTGTTA TTTCAGAGTT 
TTCCTGATGA GATGTATATC CCAAACAAGA 
GAATGAAATT AATTTCCTGT CTTTGATTAG 
CACTTCTACG TCAGCAGACA TGTCTGCAGA 
GTGTTGAAGC AGTGTGTTGC TGTGTGACAC 
GCAGATG C CT TGGGAAGAAA GAGGAAAAAA 
CTCAGCTTGT GTCTCTCTCA GTATAAT CCT 
TAAGTGCTAT TGTAGTTGCA AAT CAT CGTG 
GACCTTTACT AGACATATAC TGGAAAATAT 
TATTGCTAAA TAATGATCTA GACAGATACA 
GTTGTATAGA CCAACTGACA GACATTGTGA 
AAGTGTGCCG AGGAAGCAGG TTGTTGTATT 
ACCTGCTACT GGCTCCCTCC AAGAT CT CAG 
CTAGGTCAAA AGGGGTTGCA CCTTTTGTGA 
CCCCAGTTTA TTCTGCTCAT TATTCCATCC 
TGTGGTCTCC AAAGCATGAA ATCTGCAATC 
TGCATTAGGG AAATAAAATA AGGAAAAAAA 
GTCTGTGAAG GAGCTGAGCT GGTTCATAGA 
CGTTCCTTGC CCCAAACGAG CCTGTCCAAG 
TTTTTCTTTT CCACTCAGCA GCGTGTAGCA 
CAGGGTGAGC CGAGACCTGG GGCTGAGAGC 
TACCAGGTGG CTGCATCCCT CCTGCACACC 
ATGGGGATGG CTGCTGCGTT CTGGTGCCCA 
TCCCCTTGGG CTGTTGAAGA CCTGAAAATG 
GATGAAAGCT CCTCTGTTGC TTTGTCAGAA 
ATGAAACTTT GGACTGCAAT AATGACGGTG 
GTTCCTATGT TGTTTTGCCT CTGTCTTTTT 
GGAGATGTCC AACTCACAGG GAGCTTTAAA 
ATTATTATTA TTATTATTAT CTAGTAAAGA 
GCTATCTTTT TAACTTGATT AAAAAACAGG 
AAATTCTTGG AGGTCAAACC AAG CAAGACT 
CAGGAGATGC ACAAGCTGAT GCAGTTAGTT 
GAGGAAGATG AACTGGGAAG CCAAGCTCCA 
TCAGGTACCT AAGGCTCAGC ATCTTCTGCT 
CTTTACTGCT TCTTCCTGGT TGGGC CATAA 
AGGCACTCCC ATAAATACTA ATGTGAAGAT 
ATGACTTTAG GATGTTATTT GTCTCTATAT 
AACATTATCT CCAACTGTAA ATTAACCCCA 
GCAGGACTGC CCTGTGAGCC ACTCAGTTAC 
TTCTTTGAAG CATGGCAATT GGTTATTTCT 
AAT CAGT CAT TTCT CAGCAG CAG CTGGGT C 
CCTACAAAAT GCCAGACCTC AGCTAT CAAA 
GCAGTAATGT TCATGCACCT TCAGCTTTGT 
TTCCCTGCTC AGCTTTTAGA AGTAAC CAGT 
TTCCAGCCTA CATGGTGATG GAG CT GGTGA 
GGTGACCTGT GCTATCTCTG TCAAGGCAGT 
ACCAGTGTGC AT C C AGAAGC TCCTGAACCA 
AGTAGACTTG GGCCTGAAGG AAAGTTTGTT 
GCAGACATCT GAG AAGAGT C TGGCAGCCAA 
AAAATTTCTC T C AT AGTGT A AGGCATAATT 
CCAGATGACA CATTTTTGCC AGCTGTTCTC 
AACATATCCA CACCTTAATG TTTGTTATGA 
ACTAATTTTA TGTTCACTGT CTGGTAAATA 
CAT CACAG C A GAAAGGAAAA TAAGATTTCC 
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57 751 TTTGGGTAAT AATGAGTCTT TTGGAAGCAC ATATTGCTGG GATCTGATCA 
57 8 01 GGAAGCAGTG GTGTGATTTA CACTCAAAAC TGTCATTCCA GAAGTAGATA 

57 851 ACTCCTCAGT TGCCACTGCC ACCAAGGAGG TAT C AC AT C A GGGAAGACAC 
57901 GTACTTTGGT TTCCTAGCCC TTTCATTTGC CAGCATAGCA AT CTTGAAAG 
57951 CAAGCTCAAT AACGTTTACT TTTTTTGTAG CACATCAGTA GTCTAAGGGC 

58 0 01 ATATAGGGCT GCCTGGGGTG GGTGAGGTGT GGACAGCAGC TGTGTTTGTG 
58 051 GGTGGCAGTT TCCTCATGAA GGTTGACTTG GACTCCAGGA AGTCCTTGTT 
58101 AGTGTGGTGG GCAGCTAGAC TTCGTGATGG GAGAAG CAGT GTGGAGCAGA 
58151 GTAGGCCACC TTTTCCTTGC TAAGGACTAC ATACATTTTT CAAATCAAAT 

582 01 AAGACTGTCA GGCTGCACCA CCACCTTTCC CTCTCCCTTT CTTCAACCCC 
5 8251 CTCCGCATTG GATT CATCAG TAATCATGTG GCCAATGAGG CCTGAACCCA 

583 01 GTGGTTCCCA TTCTGGAGAT GCTGGAAAGG AGCTGTGGCT TTTTCCACCC 
5 83 51 TCCCAGCTGA AATCATCTGC TTGAGAGCAA CACGGTGAGC AGACATTTCT 

584 01 CTTTGAGCAC GAGCCCCACT CTGGCCTTGG GCTTTTGAGG CAACCAAGTA 
58451 CACCACACAA CCCTCACAAA GACAACACTG AGACACTATC TTAACGCAGA 
58 5 01 CACAGATGCA AAGTACTGGG TTGCCATCAC ACGTTTTGGT TGCCTTCCCT 
58551 GTGTGTCTTT TTCACAATTA CAATTTTTTA CGAAT CAC AG CAATTCCAAT 
58 601 GT AC CTTGGA GCACTGGTTT TGCTCAGAAG CTCCTTCAGT TGCTCCTTCA 
5 8 651 CTCCTGCACC CTCTCTTTCA GAGTCTTTGC TCCCTCCTGT CTGGTGCCTA 
58 701 TTGTAGGGCA CTTGCTGCAG GCAACGTGCC GAGGCTGCTG TTTCCTGCTG 
58751 GAGCATGTTG TTGTGTTCAG TAGCTAGGCA GGCAAAGACA ACT C AGGTGT 
58 801 GACTCATACT GCTCCTCCAG TTTAGGAGTT GCTTGAAGAA TGATTAAGGG 
58 8 51 AAAGAAAAAA AAAAAAAAAG AAGAAAAAAA AAGCGAGAGT GTGTGCGTAA 

58 901 TTAGGGCTGG GAAAGTGGGA GTGGTGTCAA TGAG CAGGCA AGCTAACAGA 
589 51 CTCTTCTGCT CAGCAAGGTT TCGCCACAGG TCTCTAGTTC TTCGTGTTGG 

59 001 ACCCATTTCA TTGATTCATT TACCAGGATT ATACCCTTGC AAAGTAAGTA 
59 051 CAACATTTTT TCTTCCTCTG GTACTTTGAT GCCTAACTAA CAGTTATAAA 
59101 ATCTCTGCCG TGAAACGTAA T CG CAT AG AG TGAGCAGGAT AAGGAAGCTT 
5 9151 GTACAAAAAG ACAG CATGTG AAA CT AAAGG TGGAAAAAAG CCTGCTCAGA 
5 9201 CTACAACCTT CTGGTTTTGT CCCAGCATTG CTGTGTTTGT CCGTCTACAT 
59251 TCCTTACCCC TGAAAAGATT CGTTGCAAAC GGTGAGTGGG GT CTGTCAGG 
593 01 TGAGACTGCG AAGACATTAG GAATTATTAG AATATTTAGA CTACCTGTTG 
5 9351 GCTGTTCCCA ATGGCTTTTC CTGAATCAGA AGAGCAGGCT GTATATGATG 
5 9401 CCATGAAAAT GCTCCATATC TCTAAGTAGG TGTAGACTGT TTGAGAAGTT 
5 9451 TACAAAAGAA TATTCTTCTT GTTTTCCACA TGGGTT CGTA TGCTTTGCTC 
59501 TGTTTGCTTT GTCTAAGCCT TGCTAGTTCA AAGGACAAGA ACTTAAGTCT 
59551 AC TAATTAC C TACTTGATCT TCAGTGTGCT CATCCGGTTG GAAAAATTCA 

5 9601 CTGACTCTTG AGGCACAATA AAGGGTATTG TGGAGACTCT CTAATTCCTG 
59651 GTGTGACTTT CTCAATTGTG TTGCTGATGG TGCTTTTTCC ACAACCTGAT 
59 701 GAACACTCTG ATCTCGCTAA AGCAAAGCAT CAGTCTGATA TTGTGTGTTC 
59751 CTCAGAGAAA CATCTGTTCA GAGGAAATCA TGTCTTAGTC ACGGAGCTGT 
59 801 GTAACCTGCC TGGTGGAGAG CTGCCATTTG TGTAGAAGTA GGAGGAAGAG 
59851 GCTCACAAGA GTTTTGTTCC TTTATATTTT GTGTTAT C C A AG CAAGAGCT 
59901 CCAGTAAGGT CATGTTAAAT GAGCTAGTTT GGAGGGGGAA TGCCCCACAT 
59951 GTGGGTTCTT TCATATCGTT TATCTAAACT GAAGTGACTG CAGGGTGTTC 

6 0001 ACTCACAGCT CATGAGCTGT AGCCCTAGTG CACAGCCCCA ATAGCAGCCC 
6 0 051 AGCTTGGATG GCCACCGCCC GGTCTGCCCC GGGTGCGCAC TCCTCAGGGC 
6 0101 TCTTTAACAA AGGCAAGAAT AAAATAAATA CTTGCTCTGC TTTAT CAGAT 
6 0151 GATGCTTACC ATTCAGCTGA CGTGACTTGT CAGGTTTCCA CACAGATGTT 
6 0 201 GCCGTTCTCC TGATTAATGT TCAGAAGATA AACTACATTT AGCTTTTCTC 
6 0251 TTAGTAAGCA AATAG CAAAC AAAGCTTTGT TTCTGTTGGT TGCATTCAGG 
6 0301 AGTGACAAAG CAAAAATAGT GT C CTATACT ACTAAACACC TTTAAGTTAT 
60351 TTTTTTTCTG CACTGATTCT AGAGCCTCTC AGCTTCCTCC TGTATCTGAA 
6 0401 CGTGTTTCTT GAACTCTGTG GCCCCATCAC AGCTTTAAGC AAAGCTGGGT 
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60451 GGATCACAGG CTGCATGTGC 
60501 CAGAATGCTG ACTTGTTGCT 
60551 CCAAGCAGCC CTTCTTGAAA 
60601 GTGTGAGAGT TTCCTGTGAT 
60651 GGACTGCTCG TGGGCACCTC 
60 701 CCCTTCTCAC TCAGCCAGAA 
60751 GGGGTGCTCA GGGCTTTAAA 
60 801 GTAAATTCCT TTCAAGCATG 
60 851 TCATTAATGA ATGCAGCATT 
60901 AAAGATGGTG AGATAGAGAT 
60951 AGCAGGGTGA AACTGAGGGT 
610 01 GCAATATATC AG TAG C AGC A 
61051 CTACCAGGAA TAGATTCTCT 
61101 CAGCTACTGA GTGAACAGGA 
61151 CCAGGTTTCC TCACTGAAGA 
612 01 GCTTGAATGG T AC C AGAAAA 

612 51 TGGTGTAAGG AAGAAGAGAG 

613 01 TTTCCTTGCT CTGAATATTT 

613 51 ACATGCACAT TTTAATATCA 

614 01 TGCTGATGTT AGAAAGGCAG 
61451 AACGTGAAAT GTTTTGAAGT 

615 01 TAAGGAAGAT TTGGAAGTCA 
61551 AGGATATTTT ACTCCGCTAG 

616 01 ACATCATTCT GGCATTGTGA 
61651 TGGTAGATGT TGAATGACAC 
61701 TCTCCCCAGT GAGAAATGCC 
61751 TTTTTACTTC AGACAGAGAA 
618 01 CTTAAA.GTGC CCTTAAAGAG 
618 51 ATCCTGTTCA GAGCAGTTTG 
61901 TGAGCGAGTG CAGAAAGGCA 
61951 TGACCTGTTG TTCCAAGTGA 
62 0 01 TTTGTTTTCA CTGTCTTTTG 
62 051 TTGATGTATG CGGAGT CCAT 
62101 TTAGTTGCTT TTATATGTGA 
62151 TGATGCTTGT AGTGCTGACT 
622 01 AAACTCTGAG AACACTTCCC 

622 51 GCTGACACTG TAAAATGGGA 

623 01 CTTGGGACAC TGCCAGTGTC 

623 51 AAAGCATCTT ACAGGGATCC 

624 01 TTTTATTTGG GGATAACTTT 
62451 TCCTCAAGAT GAATTGCTTT 
62501 TTGTTGCTGA TACCAACAGT 
62551 TAAGTGTGTT TTTCAAATTT 

626 01 ATCCTTTTTT CACTTAATTA 
62651 TACAGCGTCT TTTAGCTAAA 

627 01 AACATATACC TATGTAAGTT 
62751 AAT CAAACAG ATTTTACATG 
62801 AAAAAAAATA CCCCCCTGTT 

62 851 ACAGATTATT TTGGAAATCT 
62901 T CT CG AT AC A TTGATGTCTG 
62951 ATGCTCTTGA ATGCAGATTT 

63 001 ATGAGAGTGA GCGGTCAGAA 
63 051 TGCCCACATG TCCCCTCTTC 
63101 TCATCTGGTG AGAT CTTTAC 
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TTAGAAGGTG CCACCGTGCC GCGGGCCTCT 
CTCCTGGGAA AGCAGGGATT CAGCCAGAAT 
TTTCATTTCC AATTTTGTTG ACTCCTCCCT 
TACTGACTCA GGAGCTGTGT CTGGTTTCTG 
ATGGGCTTTC GTCTTGAGTG GGGGCCTCAG 
CTTGCTGCAG TGGGGT C ACT GACACAGCTT 
GAGGTTCAAG ACTTCGTAAT ATTTCATGCA 
TGAACGCTGT GAGCTCCTAT GTGTTGTATG 
AAAAAAGAAG GCTGATCAGA TGCAGTTAAA 
TATTCTTTGC TATCCAGCCC TTATTGAAAC 
GTTTTTTTCC CAACAAAATC CTCTGAATGT 
CTAAAAGAAA GAAAGTGATA AGCCTTGCCA 
TGGCATAACA AAGG CATTG A GAAGCATCAT 
GGACTGTAAA AGGTTCACCA CGAAGTACCT 
GGAACACAGA AACCTTGCAA AAACGATCCA 
GAATTTCTAC GTCCTGGTGC AGAATTCCAC 
TCATTTAAGT TTGCAAAATT T C ACAATTT A 
TGCCACCCAG GAGAGTGAAG CACAGGTAGC 
CTGTAGGTCA TTTGCCAATA CGACTGAAAA 
GATTGCATTT CTGGCATGAA GACAGAAAGG 
TATTATGATT GCATATATTT TCTTAGGCGG 
AAATAGC AT C AGGGCAGCCC TAACTGAAGA 
CAAATGAAAT ATTTTTCAGG TAGACTGCAC 
GATTATGCGT GTTGTTTATC TTCACGAGAG 
ATTCTTGGTT CCTTGGGTAA TTTTCCACGG 
TGGGAAGTTG GTACTTGCCC ATTTCTTCCA 
AGTATGCATA TGGATTGTGT GCTCGTGGGC 
AATGAGTTCA AAGGGAAAAA TAAGGTAGGC 
TGTAAGGTGC ACAGAAGTGC GTGTCTGTGT 
TTTTAAAGGA TGATTTCACA TGTGCTCCTT 
CTCCCTCAGC AGCAGTCCCA GGTCTTCTTA 
CCACCATTTT GCCCAAAGCT CCCTCCTCCT 
CGTTTCTAGC AAGCTTGACT TTTCTGGTTA 
GAAGTTGTGA CCACAGGAGT GACACAGGAA 
GGCACTGAGT TCTCACTTTT AC ACC CAGAA 
AAAC CTCACT CTGACAC C AG CTTGATTCCT 
TCTCCCAGGG TAAGCTTCGT TACCAAGCAT 
AAGGGAGATG GACAGACCCA TTCTGCTTGA 
TTTACATGTT GTAAACATCC TT CTTTT CAT 
CTCTGGTGCT GTATATTTAA TTTTTTTTCC 
CTTTGCGTTC GGAGGCAATT AGGAAATACT 
CAGAGCACTG TGTGAGGGCA CACTGCTGGG 
GGATTTAAAA AGTCTTGATT TTATGCCATT 
GATTGTGCAT TATATTT CAG TAACCTTTTG 
ATTAAGCCAG GTGCCTTACT AAATATATAG 
AATGAAAACA AAGACGTGAA GGCCTTTTCT 
GAAATCAAAG TTTTCTCAGC TGTGTTGCAG 
CTGTTACTCC TATAAAAACG TGTGAATACC 
CTACTCTCAA CTACCAAAAC TGCCACAGCA 
ATGTTCAGCC AAGTTTGGAC AGTATGACAC 
TTGTCATTCA AAACAC CATT CCAAACAGGG 
GCAGGTGTCC TTGCTCTGGA GACAGTTCCC 
CCTTTCCTGT CTTCTCTTAC CTAACTGCTG 
TCATCTGATG CAAC CTAGAA TGCAAAAGGT 
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63151 ATGAACTAGG TAAATGTTTA AGACTGCAGT ATTAAGTAGG CATTTGAGAG 
632 01 AAATCTCTGT CCTTAAGGTG CTTCTTGGAA GAT CAGCAAA CCTCTCACCG 
63251 AGGTAATGCT T C AG AT AATG CTACAGACTT TCCTGTTTGC GTCTTCTGTG 
63 3 01 TCAGAGCCTG AAACGTTATT GCAAATAGAT GTCTGGATAA GAACAGAACT 
63 351 GTTAAAATCA CCTTGCCATG CCATATAAGT TCCAATATTT TGCCATTTTT 
634 01 TTT CCTGGGC AGGGAACATG TTGAAGAAAG TTTTTGAGTT CTGTTGGAAG 
63451 TCTTTCCCTT TTGAAGTCCC TTGCAGTATT CATCTTTTCC TTTTCCTTCT 
63 501 GTCTCTTTCA ATAGACAGAG CTGCTGAGCA C CAATTT AT C AGATTGTCTT 
63 551 TCCCCTTCTT TAGGGACATG TGATTCTGGG GATAGAAGAC AGTCAAACTC 
63 601 ACTGTGCCAA AGGAGTTACC GTCTTCCATA TTTGTGCTGC TCTTAAGCTC 
63 651 GATGCGATAT TGACTGAAAT TCTGTGGTTT CCCTTTGTTG TCTTTAATCT 
63 701 ACACCAATGG AGTTACACCG AAGTGCAGTT TTAGAT CTAT GAAAGCAGTC 
63 751 TGGAAG AT CG AATATTCCGT GTCATTCCCA GAACGTGGTC CAGAACATCT 
63 8 01 GTCGCTTGGC ACCACCTTTT CCATTCCTGA CTGCATAGAT CAGCTAACAG 
63 851 CCCTACGGCA ATTGCAGTTA CTCTGAACTG CTAGGAAAAT ATTTGCAGTC 
63 901 ATCATTGTAA GTGATGAGTG GGCACATAGC AGTATTTATG TAGGAGGCTA 

63 951 AGTACTTAGA GTTTCTAGGA TGATCTCAAC CTACAGGACC GGACAGCTTT 

64 0 01 CTGGAGAGTT CTAGCAAGGG TAAGGAGAAC AGGGAATCAC CTCTTAGAGA 
64 051 GAGGACATGC CACAGCTAAA GCTTTAATGA ACAATTAGAT GTGAAGCAAG 
64101 AGACAGGAAA GATGATTGTG AGACTTTTAA AAGCCTATCA AAGCACTAGG 
64151 AGAGC C C AAA GCATAGGCAA AGTACCTTAT AAGTTGGCAC ATCTGAAGAG 
64 2 01 TAT C AATTAA AAACATATTA AATCCATATG TTAT CCGATG TGATT CAATA 
64251 TGTGTGGGTC ACCCTGACCA ACCCAGATTT CTCCACGTAT GTCTGGTAAT 

643 01 ACTGGCTCTA CGTAGCACGC AGAACTG C C A GCTGTCACTT GAAGGTAAGG 
64351 GCTTCTACTG AGCCACTCGC ATTACCTTGG TTGGGCATGG ATGAGAGACT 

644 01 CCTCAAAAGC TGCTGGTGGT GTCTGAGACT GGGCAGGATT GGTCAGGCCT 

644 51 TTCTCGCCTC CCAGCGTAGG TTCAAGCTGC CCAGTCCCCA AACTGGTGTC 

645 01 CAGCCTCCTT CAGCAAGGAA ATCAGTGACC TGCCAGCCTC ACTGCAACAG 
64551 GAG CT C AC T C TGTGGGTCAT CTCTATCCTT TTCTGTTTCA GGATGACGAT 

U gene exon 1 

646 01 GGATGCTCTG CAACTAGCAA ACACTGCCTT TGCTGTTGAT ATGTTCAAAA 
64 651 AGCTATGCGA GAAGGACAGA ACAGCCAATA TTGTGTTTGC CCCACTGTGT 
64 7 01 ACCTCCACAT CTTTGGCTCT GGCATATAAA GCTACAAAGG GTGACACTGC 
64 751 AGACCAAATG AAAAAGGTGA GCTGTCGGCA TCCTGCTGTG TAGCTGCAAA 
648 01 ATTGTCAGAG GTGGCTTTCC TATTTATTCC TCTTAATGCT GTATAGGACT 
64 851 GCTGGTTCCC TTGTAAGCCA GGCAGAAAAC TGTCCATCCA AAATTCCAGA 
64 901 ATATTTCCCC ACTCCATGGC TCCACACAAC CAAAGAGGCT GAAAAT CACT 

64 951 AGCATAGGGA AAAAAGCTTT CTCAAGCATT TACAAGGTGG ATGGGGACAT 

65 0 01 GGCAGAGTCC TCAGCAGTTG TATTAAGGCC TTGTCTCCTT TCAGCAGGAA 
65 0 51 TGCTGATTGT GGCTGAAGGT GACTGCTGAA GTCACTGCAT TTTCTGGATA 
65101 ATTGTTTAGT GATTATT CAG GACTGCCTAA GCTTAACAGG ACTGGAAATA 
65151 ATTTTGCCAT TACCAAGTAA TTTTAGCAGT TCTGTCTGTG CCATTTCCCC 
652 01 TTTCTCCTGC CATACAGCTA AGAGGAAGAT AATGCAGTAG GAGGCAGCTC 
65251 AGCTTGAGTA GTAGTTTGCC TTGCAAATAG CTCTAGATGC TCAAGGGTTT 
65301 TACAGCACCA CGAAGCAGCA TCATGGTGAT GGTGCAATGA GTTTATCAAG 
65351 GTTGCT CTGT GGCGGTGAGA GGCTGCACGA CTGCCTCTGT GAGAGCCAGG 
654 01 ATTTACACAG CCTCTTTTTA TTCCAGTGCC CACAGTCTCA GCAGTTACCT 

654 51 AGAGGTGAAT G AGAAG C AAA TTCAGCATGC ATTTATATGC TGATT AT CAC 

655 01 CTGGCTCTCA GGGGCATTCC ATGTATTTGA ATACATTTTT CTTCGTTTAG 
65551 CAGTTCCTCC TTGTACCTTT GGTTTCCCTG ACGGCACATT GCTGGAGCAC 

656 01 AGCCTCTGGC GCCTCTGCTC ATCCTACAGA TTGCAATGAG TCTATTTGCA 
65651 CAGAAACAAA GTGGTATATC CACAAAGGCC TGCTGGGTGT TTTCCCAAAT 

657 01 AGGATTATTT TAAAAAAATA AAAATAAAAA TGATTTTTAG ATCTTATTTC 
65751 TAGTTTAAAT GACACCCCAA AGCTTCCTTG TCATTTCAAA GTTCAAGCAC 
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658 01 TGTCTTTGCA ATGGAAGAGC TTAAAACATT AACCTGTGCT TAATTTCACT 
65 8 51 TTCACTTGTG CCTGCAATTT GCATTGAACC GTCCCACAAT AAGTGAACAT 

65 9 01 CCACATCCAC AAATAGGGTT CTGTTACACA AGTG CACTT A TGTTTCACAT 
65951 TTCTCAAGGT AATTTACTGT GCCTGTAAAG ACATGGTGTG TTCAGGGAGA 
660 01 AAGAGCAGGA GTGAGGCTGA AAGGGAAAAG GAGGT CACTG ATGCTGGTTG 

66 051 GGAAAGATGA GAAGGGTTGG GCAGGCTGTT TTTAATGGAA CATGCACTCT 
66101 C AG AG AC C TT GCAACAGGCA GGCACCTAAA AGCAGAGAGG TTTAGGTCAT 
66151 GCTAGAATAT CCTGGAACTG GGCATGTGAT TTCCCGGAGC TGGGAGGTGG 
662 01 GTCAGCAGCC TTACCTCTAA CTTACGTTCT GTCTGCCAAA GCTCACCTGC 
662 51 TTATCTGACT GATTTCTACT GAAATACCAC ATGACAT CAT GTGTCAATAA 
66301 TCAGAAAACC TTGCCATATG GTAAG CAGTT TTTAAAGAAG TAACC CACTT 
66351 CCAGAAAGGA AACTAACTGG AACATTTATT TATCTGGCCT CTAAACTCCA 
66401 GATTTTTGGA CAAGAATGTG AGTTTGATAA AAGCATGACT CCACGCTGCA 

664 51 GATATGTAGT TCACTAAATC ACTTTGCTAG TATGAACAGC TCTATGGAAT 

665 01 TCTTTGGACT GCTCACAGGA AGGAAACACA TTTGGTTAAA GTTTTGATAG 
66551 GATCAAGTTT T T AG ATTT AT GTGGGGATGT CAAATAAATT AATTTTTTTT 
66 6 01 TTAGTAATAA ATAAGAGTGA GAAGTCGTGT TGTTAGCTTG AACACAAAAA 
66651 AGT CAAAGCT CTGGTCACAA ACAAGCATTA TTTATTGCCA AGCTGT CAGG 
66 7 01 CCTGGAGCAT GTCCAGAGAA GGACAACAAA GCTGTGAAGG GTCTGGAACA 

66 751 CAGATCTTAC AGGAGAGCAG CTGAGGAAAC TGGGATTGTT CAGTTTGGAG 
668 01 AAGGAGAGGC TCAGGGGAGA CCTTATCCCT CTCTACAACT GCATGAGAGG 
668 51 AGGCTGTGGT GAGCTTGGGG CTGACCTCTT CTCCCAGGTA GCATTAATAG 

CR1-L 

66901 AATGAGAGGC CGTGTCCTCA AGTTGCACCA GAGGAGGTTC AGGTTGGATA 
66951 TGAGGAAATT TTTCTTTTTC TGAAAGAGCA GTGAGATATT GGAACAGGCT 
67001 ACCCAGGGAG CTGTTCAAGA AC TGTGTAC A TGTGGCACTG TGGGATATGG 
67051 TTTAGCGGGC ACAGTGGTGG TGGGTTGACA GTTGGACTAG ATCATCCCAG 
67101 AGGTCATTTC CAACCTTAAT GATACTATGA TGCTAT GAGT TTTTAGATAA 
67151 TAAAAAGAAA GGTGCTCAGT ATTTTATCTT GTTCATTATC AGGTGCTCCA 

U gene exon 2 

672 01 TTTACAAGAC GTCAAAGATG TTTCTTTTGG GTTTCAAACG GTAACTGCAG 
67251 ATGTTTCCAA ACTCACCTCT TTCTTTGCAC TGAAAATGGT CAAGCGGCTC 

673 01 TTTGTAGACA AGTCGCTCAG CCCTACCACA GTAAGTACTG CAGAAAAGTG 
67351 CTTGAATTGC TCGACCAACC AGACTTCAAT GTT ATT C AAA ATACGTTCTC 
67401 TCACTATTAG CTTTTACTTG ACT AGACT CA GATGATGAAC AGCATAATAA 
67451 GAGTTTGTAG GAGGATGATT GTTCTGCTTG ACCCCAAGCA ATGCAGCCAC 

675 01 TGCTAGAGTT GCAATTCTTT CATTAATATG TTTTAGGTCA GTAGGCGCAG 
67551 TAGGTTTTGA ATGCAATATG ACTTCTATGC CACATCAAGG GCTTTGCAAT 

676 01 ATAAGTATGA CTGGGAAGGA TTTTAAATAA AGATGGTGGT GCAAGTGTGT 

67 651 CTAGTCCACA CACCCAAGTA ATTACTGCAT AAAGAGTAGT TTTCTTAATC 

677 01 TAACTGAGGA GGCACAAGCC TGGTTATTCA AACAACACAA GTGAGGAAAG 
67751 TGTTGTTTGG CCATGAAACT TAAGGACCTT GCAAACAACT GAGAAAAATG 
67801 TTGTGTTTGT TTTAT CAGAG TTGCCTTTGA ATAGGGCCCC AAGCAAGGGC 

67 851 AACTTCAGCC TAGAAGTGAT GTTT CAGAAG ACTCACAGCC TGCTTGAATG 
679 01 GTGTTATAAT CAGGTTGCCT GCTTTTTGGC CCCATCCACA GCAGTGAGCA 
67951 TCTCACCTGA CAAGGATAGG CACACTGTGA GCAGCCTGTG GCCTTTGTCT 

68 0 01 CATCCCTTTC TTTTGCCCAG GTGTAGACTG AAGGCTACTT TATCCTTTCA 
68 051 AACTCAGGCA ACATGTTCAC TCCTGCAGTA CGAAAGGTAC TTTAGCAGCC 
68101 AGTATAACTG TATTGAAGAC AGTCTTGGGA GCAATCTGCT GAATGCGGCT 
68151 GCGTGTCCTG GCTGTCACCT GCTGTTACTT ATTAGCTGTC CTTTGTAATA 
682 01 TACTCTCTGC CTACACCGTA ATGAAGCTTG GGATACTGGT TTTGTAGGCC 
68251 GTGTGGAGAG T CAT CT AGTG AAGAACATCT AAGGAAGGTT AGCTTTGGTA 

U gene exon 3 

6 83 01 CCTTGTGTCT TTCAGGACTT TGTTAACTCC ACAAAGAGGC CTTTTCCATC 
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683 51 AGAGCTGGAA CTAGTGGAGT TCAAGGAAAA AACTGAGGAA ACACGGCAGA 
68401 AGATCAACAA ATCTCTCTCA GAGCTAACTG ATGGTGAGTA GGGCCTAACC 
68451 TCGGGGATGC TGATTACCTC TTTGAAGAAT GATGTCTTTG TCTTCATGAC 
68501 ATCTCCTAAC TATTGCTTTT AGAAGTAAAT ATACAGTGAA AG CAAAGGGA 
68551 CTGCACCTAT TATTTGGATT CATGAGGATT AGCTGTGTTA GCATGTTTTA 
686 01 AAATCATTTA CTTTACTACT GTGGC ATTTC TGGAGG CAGA CCTTACATTA 
68651 GCCTTTGGCA AAGCATCTCA TTTGTTTTCA TTGGGAAAGT TTGGCTCCTG 
68 701 GCTGCAGAGC TTCACAAACA TCTGACATCA ATACATCAAA TCCTGGCCCC 
68751 GTTCTCTAAT GGAGAGTATG TGCTGAACTC TGAATTT C AG GCTGTTAATT 
68 8 01 AG TAG CT CAT CTCAGCAGCA CAGCTGATTT TGAC CACAGG TGGACATGTG 

68 851 TTTCTTACTT GGAAACACTC CCGTGGCAAT AGTTCTGCAG CACTTTTCCT 
68901 GCAGTAC CAC TGAGC CACTA AGTCACAAGA AGTGCCTCTC AGTGAC CAT C 
6 8951 AAGGCTCCCA GGCAGAACCT GCCCAGTCTG TGCAGGGTAG AGGTCTGGTA 
690 01 CGCAGTGCCC AAGGCAGAGC TCATGTACAT GCTGTCCATA GGTAGCTCCA 

69 051 GGGTTGGTTG CTGCCTATTG CCCTCATGTG GTACACATAT GAAAATATGG 
6 9101 GTGCCTGAGT TACATCTGCT CCATCCCGAG GTGACACAGG TGCCCACAGG 
6 9151 GAAGTACTTT GCGCTGCCTG TGTGATTTGT GCATGAATGA AGACTAACAT 
69201 CCACAACACT GTGGATTCAG TGCCTCATGA CAGTGTTTGA ACAGACACAA 
69251 AATAAAGCAA GGGAAAGAAT TACGTTCTCT TTTTGAAATC CATGGCACTA 
69301 TTTGGTTATG AACTGTAATT AGATGGTTAG CGGCATTTCT TATTCGGGTT 
6 9351 TATTCTTATG TATCACTCCA AAAGT GAGT A GAAGCTAAAC TGGAACTTCC 
69401 CTTGAAGTCT CGCTCTCCAA ATGAGAAATA TTTTTTTCAG TTCTACCTGC 
69451 TGAATTTCGC TGAAGTTTCA GTACCTTCTT TAAAGTACTA AAGAAAAGCA 

695 01 GTAGACATAT TTTTTATTCT GTTTTATGTA AACCGAGTAA AAATGTCACT 
69551 TGGAAGATCT GTCTTGATCC CAAATT C CAT TTTAAACATG GAGCTGCAGC 

696 01 TAAGGAACTA AATGCTTCTA TTTGGGGATT TCCCTTTATA ATTAAAACTG 
69651 CTATCTGTGA GGTGCAGGGC AGAAATATTT TAATTCAGTA CAGTGTTTCC 

6 9701 ATGTTCTGTG AAAACAGCAC ATATGTTGAT AATTTACTGT AT TAATGAC C 

U gene exon 4 

69751 AGCTTAACCA TCTTCACAGG CAAAATGGAG AATATTCTGA ATGAGGACAG 

698 01 TGTAAGTGAC CAGACT CAGA TCCTCCTAGT TAATGCAGCT TATTTTGTCA 
69851 CAAACTGGAT GAAGAAGTTC CCAGAAGCAG AGAT CAAGG A ATGTCCTTTT 

699 01 AAAGTCAACA AGGTACGTCC TGAAATAAAA TAGAGTACAC CTTCTACTCA 
69951 GATGAATGTT TGCCAATTTT GTGCTAAGGA AATTTCAGTG AGAGCAAGTG 

700 01 AAAAATATTT GTTACTACTA TGGCATTCTT AGACTCTCTG TCAAAACCTA 
70051 TGTGCTGTTG CAAAAGTAC C TAAGCCAGTT TTCTTGTTAC GTTGCTAGTT 

7 0101 TGAAGCTGTT GGTGAAACAA GCACTAAAGG TCACCGATAG TAGGTAATTC 
7 0151 TTTCCTTTAA AGCACATCCC CAGTATATTG TATTAAGTAC ACCTTGTCAC 
70201 ATGAAAACTG CTCCCCTTAA AGT AC CAAC A GCTTTCACTA GCAGTCTTAC 

70 251 AGCTGATATC GTTACTTACA GAAGCCAACA AATT CCATGA TGGTAAT CAA 
70301 TGTACCACTT TCATGCAAGC TTGCAAAGTT TCCTCTCTCA TCTTCTCTGT 
70351 GAATTAAAAG GAGTGCTAGA TTGTCTCCTC TTGTGTTTTG CAGACT GAAA 

U gene exon 5 

70401 CTAAGCCAGT GCAAATGATG AATCTGGAAG CTACTTTTTG CCTGGGTTAT 
70451 GTGAAAGAGT TGAATGTTGC AATCCTTGAA CTTCCATGCC TTAACAAACA 
7 0501 TATAAGCATG CTCATTCTGC TTCCCAAAGA CATTGAAGAT GAAACGACTG 
70551 GCCTGGAAAA GGTGAGAGAA AAAAACAGTA CTGAGATGAT GCTTTCCATG 
70601 CACAGCTGTG TCGGTTAGCT GTGGGTAGCT TGGGTAGGGA CTGTCTTCCT 
70651 TGAATTCCTT CATTGGGTTG TTGAGCTGAT TACATAGCAA ACGCTTGTGA 

707 01 AGAACCAGTA AT CAG AGTAT GCACATTTAG TGGAGTTT CT CTGGAAGTCT 
70 751 ACTCTATAGG TTAAATAATC ATTATATCAA TATAACTGAG AGTGTAAGTT 

708 01 AACTCTGAAT GCTACAAGCA AAAGTTGTCT TTTGGACTTT GTTTTTTTGG 
70851 GGTTTGATAG GACTGATGAG TTCAGAAATG GTCTTTTTGT TCCCACTTTC 
70901 TCTGGACTGC ACATTAATTT CCTTTGTTCT TTATGTCCTC AGCTGGAAAA 
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7 0951 GGCACTCACC 

71001 CCAACACCAA 

71051 TATGACCTGA 

71101 TGAGAGTGCA 

71151 TGTCAAAGAT 

712 01 GAGTCTCTAG 
71251 TAAAGCTGAC 

713 01 ATGTGATTCT 
71351 TATGAAAAAG 

714 01 TGTCCTGACT 
714 51 AGGGTAATAC 
71501 TTATTTAATC 
71551 ATTTGACTAA 

716 01 TGTGACATCT 
71651 ACTAGGAGCT 

717 01 AT C CAAAAC A 
71751 ATGTGGACAT 

718 01 GTGATCCAGG 
71851 AGCTGAACAA 
71901 TGCAAGAGGC 
71951 TCAGTTTGTA 
72001 TCATTCTCCT 
72 051 GAACTCTGTT 
72101 TAGAAATCAC 
72151 TTTTCTGTCA 

722 01 AACCTCTGGG 
72251 ACACTGTAAT 

723 01 TTTCCTTTCT 

723 51 CCTAATTACT 

724 01 TAGCACATCC 
72451 ATAGGAACAG 
72501 ATGAAGAAAT 
72551 TAGCTGACAA 
72 6 01 TTCAATTATA 
72 651 TGGTTCTTCA 
72701 ACCCTTAATA 
72 751 TATTATGGAT 
72 801 ACATGTAGCC 
72 8 51 TCTGTGAGTC 
729 01 GGATACATTG 

72 951 AACCAACAAG 

73 0 01 TCATCAGGCA 
73 051 ATTTCATACT 
73101 ATAGCTTCAG 
73151 CATTGTCTGA 
73 2 01 ATCACTGCGG 
73 251 GGTTGGCAGC 
73 3 01 TGGTTGCCTA 
73 3 51 GTCTAGTAAT 
734 01 GCTGTGCTGG 
73 451 TCTCTTTTTC 
73 5 01 CACTCTCAGA 
73 551 AATCCAGCTC 
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U gene exon 
CCTGAGACAT TATTACAGTG 
AGTGAATGTG TTTCTTCCAA 
AGCCACTCCT GGAAAGCCTC 
TCAGATTTCT CTGAGATGTG 
CATT CAT AAA GTCTCCTTGG 
AGGTACCAGG ATAT CGGATT 
CATCCGTTTA TCTTTTTGTT 
TTCAGGCAGA TTCTGTTCCC 
ACCATAAATT TATGGTGATG 
ATCACCTTTG AAAGGAATTC 
AATGTACTCT ACATATG C AG 
CCCTTAAGCT GAAGGATTCC 
GAAGTATTCC ATCCTCATCC 
TTTTCCAAAA CAAAATGAAC 
GGAAGAGGCT CTGGGGGAAA 
CCTTCCCTTG AGACT CATAC 
GTGGTGTGAT AGCCCTCCTC 
GTGCCCTGCT GGACCCTGCT 
CTCAGAGTTT GCATCTGTGA 
AATAATAAAA CACCCAGAAG 
GGTTGGGGAG GTTGCACTCT 
TCAGACATGG CAGAGGTGAC 
GTATATATCC TGAGAAAAAG 
TTGGATTAAT ATTGAAAATG 
CTTCTGGATT CAGCATTAAG 
ATT CAAGAAA GAAATAAAGG 
TAGAAGTCTC CAAATTCCAT 
AT CTAAAGAC TGCCTGCTGC 
GTT CAGAAT A AAGCACTACA 
AGTGTCAGTC TGTTCTCCTT 
AAATTGGTAG ATTCATTCAG 
TAAAT CTGTG ACTGAATTGC 
TTAAGGAGTA TAGTACTATT 
TATCTTCAAA GAAGGGGCCA 
GATCTGAAAG AACCTAGAGA 
CATATTCCTT TCTTCTCATA 
TTGGAGAGAG TTCTGTTTTG 
TTCAGTCACA AACGTTCAGG 
TCTAATGCTC TCTGACTTAT 
GTTCACATCT TTGAAGGCAC 
AGATGAGTCC CTGCAGGCAT 
CT CTGAT AC A TAGATAGAAG 
GTCTGAGAGG AGACAGCCTG 
TGATAAATGA GTGTCTGGAA 
TCTGTTTTGG AGGCTGCGGT 
TGTCCCCTGG TTCTCCCATT 
CGATGAGCAG CTGGGCAGTC 
CAGTCTGTGC AGTTTCTCTG 
TGCAGGTATC CAGCAAGCCA 
TCTGCTAGAC TGATGAAAAT 
TGCCTGGATA AGACTTATTA 
TTTTCTAGAT CACGAAAACA 
ATACTCCATT TAAATGCTGA 
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GACCAATCCC AGCATGATGG 
AGTTTAGTGT GGAAGGCGAT 
GGCATGACAA ATGTCTTTAA 
TGAAACCAAA GGTGTGGTTT 
AAGTAAATGA ACAGGGTGGA 
CTGCAACACA AAGATGAATT 
TAGGCACAAC AAAACTCGCA 
CATAAGCAGA GAATATTAAT 
CATGTTCCTG TAAAGCTTGG 
TAAGAGGTTC ATATATCAAC 
CAGAACTAGT TTATTTCCTT 
CACTGTGCAG AACACATGAT 
ACGAGAATAT TTTGTTCCTC 
AGAGAACCTG TTTTTGAAAG 
GAGCTGCATT CCTGTTTCAT 
TCACTGCCTA AAGGGGGAAA 
TTGTACTTGG CTGTAGTCTG 
AATGCACGGT GAAATAGTGC 
AACAGCAGCT GCAATATGGA 
ACTCTTCAGT GTGTGCTACC 
ACTGTGTGGG ATTTTTTCAC 
CAGTTCACTG CAGCTGAGAG 
AAGGCTGTGC AGTTCTAGGA 
CCACACTCTT CAGATACAGA 
GAGGCTCCAC ACGTCTACCT 
CTTTGACTTG AGTGAGATTA 
ATGAAATTAT GGTAATTAGT 
ATATGTT CAG TACTGATTTA 
AAACCTGTGT CAAATGTCTG 
CACTCAGCTT CCAAGAGGGG 
GACACAGTTA AAATAAATAT 
CCTTTTGGAC CACACGATAA 
TGGTCAATAT ATAGAGTGAG 
TTTTAACTGA GTATTCCCCT 
TTTCTAAATG GGAACATACA 
GCAGAGAACA GCACAGTGGC 
ATTCTTGGCG TTCCCAACCC 
GTTTCACCAG TTGTCTTCCC 
CTTAATAT C A GACAGTTAGT 
TGAGGGGACT AACTCACCAA 
CTGGGGGTCT CTGCTGCTCT 
AATGCTATGT GTAAGACTTA 
CAAAGACCTT TACTAGCTCA 
AATGTTTTAA TGTGTCCCAT 
TGGAGTTATT C AG AGCTGT C 
GGTCTGGGCA TTGCACGTGG 
TGTACATAGG CAAGGTGGAC 
TGTTTCAGCA CTGACTCGTG 
AATCACACTT CATTGCTACT 
CTCTGTTAGA CTGCCTCATC 
AAGGAGAAAA GCTGATATAT 
TTGCAGTGCA GGAGC CATT C 
TCAGAACACT T AGTG CAT C C 
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73 6 01 ACGTACGTTC CCAGAGAGCT CCTTGTTGGT GCCTTTGCCA AGGAGGGCTA 

73 651 TCCCATGAAA GCACACAGGA ATGCTGCCTC CGGCAGAGAC CTGTCTGTCT 

737 01 CCACCTTCTG ATTAACACAA AGGTACCCAA GACTATGCAA GGCAAGCTGA 
73751 TATTTCAGAT AAGAAGT TGC CTCAGTTTAC AGAAGGCCAT GAAAAGCTCT 

738 01 GGTTCATTTT CCATATCTGC TTCTTCTCTC TGCTTTTGGA AAATAATTCT 
73 8 51 TATTTCCTAC TAGCAAGTCC AGACTAGTGT AATATTTAGT CTTGATTTGT 
73 9 01 TAAAT CAT CG TGAATTTTAG CTTTTACTAA TGGTAT CTTA GATGATCTAA 

73 951 TATACTAACA CCTAGTAAGT GACTT CAGAT AAGGTGTTAG GCTTATACCA 

74 0 01 CAGCCCAGTT TGAAGGAGCT AAGTGCAGAT GGCAACCAAA CAAGCAACAA 
74 051 CACAAATAGG CCCAAATAGC CCATGGAGGG CCAGAAAGTA CAGCTGCAGG 
74101 CAGAGCTGTG ATGTAATCCA TTTTTGTGAG CCCCTTGAAG CCAGCAGGCC 
74151 AGCCTCGTGC TTTTAGTGTA ATGGACATCA T AC AG C C AAA AAGGAAGGTT 

742 01 ATGCCATAAT CCTTGCTCCA TTCAGCAATT GGACT CGAGT GT ACT AG C C C 
74 251 TGTTTCCTAG ACCCTTCCAG TCCTATGGAA ATTTAGAAGT CCCAATGTAA 

743 01 AACCTATTCA ACTTGCAGCA TGTGAGAGGA AGATATCAGA ACAGCTCCCG 

743 51 CTAACGTGAG AAATGCCATT ACAGGGTACT GTACTGTTCC AC CCATTCTG 

744 01 GCCTCTGGGC TGTGGGCAAG AAACTGCATG GGAGGACATG GGAAAAC CTG 
74451 TGTGGGAGCT TGCATATAGA AATGTATTTG TACTCAAAGG CGTTGGCTGT 

745 01 GACGGAGAAA GTGAGCATAG GTGAGGACTT GCTCAGATCA CCAAGCAACC 
74551 CCACCTCACC TTTCAACACA GCATTACTAT CCAATGATGG GTAACTGGCC 

746 01 TGATGGAAAA AAGCAAGCCA TGGCTGGCTT TTGCAGGCAG TTGCAAACTG 
74 651 CAGTGTTATG CAGTCTAGCC ATCT CATAAC TTGCTTAGCT GCATAGT CAT 

747 01 TGCCCACTTC CTGCTCCCAG CATTTTGTAG AAGAGAACTG GTACAT CTAA 
74 751 ATGCTTTGCA GTAGCAGGAA TTGGTTTTGG AGATGAGCAG CTGGTTTTGA 

748 01 AACTTGAAAA ATGCCACATA CAGGCAATTG GCCTGGCTGG AAAAGCAGTG 

CRl-GG 

74 8 51 CGGTTTTGAG CCTTAGGTAT GTTACATGTG C AAGTGTAGA GTCCTACACC 
74901 TGGGGAGGAA TGACTGCAGG TACCAGTACA GGTTAGGGGC TGAGCTGCTG 
74951 GTGAGGAGCT CTGTGGAAAA GAACCTCGGT GTTCTGGCGG GCAACAGGTT 
75001 GGCCATGAGC CAGCAGTGTA CCTTTGTGGC CCAGAAGGCC AATGGTATCC 
75051 TGGGGTGCAT TAAGAAGAAT GTGGTCAGCA GGTTGAGGAA GGTGATCCTC 
75101 CCTCTCTGTT CTGGTGAAGC CACATCTGGA GTACTGTGTC CATTTCTGGG 
75151 CTCCTCAGTT CAAAAAAGTC AGGGAGCTAC TGGAGAGAGA GTCCAGCAGA 
75201 GGGCCACAAA GATGCCTGGG GTCCTGTAGC ATCTCCCTTG TGAGAAAAGA 
75251 CTGAGAAACC TTGGGTTTTC CAGA CTGGAG AAGATAAGGC TGAAGGGGGA 
75301 TCATATCAGT GATTACAAAT ACTTAAAGGG CAGAAGCCAA GTGAATAGGG 
75351 CCAGGCTCCT TTTGGTATCC TGTGACAGGA AATGGGCGAA AATTCAACAC 

CRl-b 

754 01 CAACAAGAGG AAGTACTTCT CTACTTTGAG GGTAACAGAG GACTGGAACA 

754 51 GGCTGCCCGG AGAGGTTGTG GAGTCTCCTT CTCTGGAAAT ATTCAAAACC 

755 01 TGCCTGGATG CTTTCCTGTG CAACCTACTC TAGGGAGCTT CTGTAGGAGT 
75551 GAGTTGAACT CAGTAATCTC CAGAGGTCCC TTCCAACCTC TACAATTCTA 

756 01 TGATTCCATC CTAACGGCCT TAGAAGGGTC AGA&TTTGCA CATACGGTAT 
75651 AATGTTCTGA GGCAAAGCAG TGAAACAGAT TGCAAAGAAG CTCTAAGGAG 

75 7 01 ATATAGAGAG CAAATCAAAG AAATGAGTGG AGGAACGACG TCAGTGTAAA 

757 51 AGGGAGGGAG AAAGCACAGA GTTTGGAAGA ACGAAAGCAG TGAAAGGTAT 

758 01 TCAAAAATGG CACTGAAAAG TGGCTAGGAC TCACAAAAGC AG CAGAA GAA 
75 851 AAAATGGAGA ATGGAATGGG AAAGGCAATA GGCAGAAAGA AAGAAAAAGA 

759 01 TAAAGAGGCA GGAACAAATG ACTAAGAAGT CTGAAGAAAT ATGCAGAAAG 

75 951 GAAAAGCAAA CAACAAAGCG AATCCAAATG GAACAGAAAA AAAGTGAAAA 

76 0 01 GAA GGAAAA T ATTACTGAGG AGATCTGATA CTGTGTGCAG ATTTGTGCTT 
76 051 CCCATCTCTT TCTTTGCATC TCACTGCATG TTGGGGACAT AACCCCATGT 
76101 GATCAGCCTC TCTTGGACTC CTATTCTTAG CTCCAGTGAT TTAAAAGAAT 
76151 ACCCCCTGGG TGGGGATTCC TGTTACAGAT CAAGGAAATA CTTCTTCCTC 
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762 01 TCTCTAATTA AAATCCTTCC 

76251 GCGTGCCTAC TACCTGTAGG 
76301 TTGCTCTATT TCTGCAAGCA 
76351 CCTAGCAAGA ACCCTGCACA 
76401 TTCAAAAGGA ACTACACACA 

76451 GCTGTTCTCC AGCTAAGGGC 
76501 AGGATGCCTC CTGTCAAACT 
76551 GCACATTGCT AGATATAGGT 

766 01 GACATTTGCA TGAAATTACA 
76651 GACTGAATTT TCCCACAAGA 

767 01 CAGGAAGGTG CTCCAGAGCT 
76751 G CT CAAGAAG GTAAACATTA 

768 01 GTTGTTGTTG TTTTGGCCAA 
76 8 51 TATAGATGGG GCAAAGGGGG 
76901 GACACTGCTA AAATTTGGGT 

76 951 GGGTGAGGCT GGGAATTTAG 

77 001 ATGCCTGTCT AAATTCCTCA 

77 051 CCTGCCTGCT TGGAGAGCTA 
77101 CATGCAGATG CTTTGCTGCC 
77151 TAGAACAGCC CAACTCACTC 
77201 TGCAG AT CAT CTGGATAAAC 

772 51 TTTCCTTCTT TACTCAGGTT 
77301 TCACTAGACA AGAGAGGGCA 

773 51 TCCTCTGTAC CCTGGTCTCT 
77401 TGGCTGTATA TAATAGGAAG 
77451 AATGGGGATG CTTCACTATC 
775 01 GTT CAAAAAT TTGCTGAGAA 
77551 TAGCTTGCCA CTGTATGGCC 
77601 TAATCAGCCT AGAAAGAAGC 
77651 AAAATAATAG AACCAGTCAG 
77701 CATGAGGGGC TAAGTTGCTT 
77751 AAAGATATTT TGTGCTGAAA 

778 01 TTTGGAAATG CTTTGTGTTG 
77851 GGAAATTTCA TTTTTCTTAT 

779 01 CAGAAACACT TTGAAAGTTT 
77951 ACAACCATAG GCTTTTCATG 
78001 CTTTTCTAGG ATAGACACGT 

78 051 TTATGAGAAA TAAATGTATA 
78101 TATGCAATTG GAATTATCAT 
78151 CTTTCCCTTT CTACATATAG 
78201 TCAGAATCTG CATTTTATTT 
78251 ACATACCACA TCTTTTAATT 
783 01 TCAGTAAGTT TTGCCTTGAT 
783 51 AGAGAACAGA GCGTGAACAG 
7 8401 GAGCCATGGA AGCAATTTGT 
78451 TATCTACGGA AGGAGGCTCT 
785 01 CAACGCCAGC AATTCCACCT 
7 8551 TGCAAAGCAT CTCCCTTCAT 
78601 CTCTGCACAA TTGCTAATGT 
78651 AACAGCTGTG ATCTTTCTGG 
78701 CTTCCCACTC CAAGAT CTTT 
78 751 ATCTAGGTGT TCTCAAAGTT 
78 8 01 CCAACAAATG ATTACAGAGA 



30/79 

ACTCACAGAA AGCTGGCTCT GTACCTGAAT 
MAR (0,81) 

CAACATAAAG CTCATGCATT TCCTATTCAT 
AGCTCAGCCC CAAACAAGGG ATCTCTAAAT 
CCCCAGTGTT CAAGTCCTGA CACCACCAAA 
GCACCACAGC CATGGATCTC CAGTGTTAAT 

GACTTGGCTT TGCAGTCAGG AGATGTTGCC 
AGTTGGGCAG CTTTGAGCGA AATGCTGTTA 
TTCCTGGTCT TCTGCAGGAA ACTGAAGGAT 
ACGTGCAGCC TTTATCAACA ATTGGCTAGA 
AAGTGGAAGA AATTTAAAAT AGAGTATACA 
CAGCTGTTGT GTTCTTCATT TGACCTCCTT 
TTTTTCTCTT CAAAAATAAC TTGTCTTGTT 
ATCAGTCTAA AAGTTGGTAA ATTTCATGTT 
AAGTACTTTC ACAGGCTGGA GAGAGCAAAA 
GGTCTTCACG AAGGGAGGTG GTCTTCTCTG 
GAACACATGC C CAAAGCT AT GAATCTAAAG 
GACTTTTGAC TGAAATTTCC CTCGGTTCTC 
TAACTGCCAC AGACTGAGTG GTTTATACCA 
TACATCTCCA GAAGGGTCAA AGGGCTGTTT 
TAAAAAAATG GGCTTTATGA GAAGCGATGG 
T CAC C CAT AA ATTAATAGAA ACAGGTTAAT 
TCCACAGCAC AAGGAAAAGC CTTGAAATGT 
CGCAACTCTT TGGTTGCGTG CTTGGGTGTT 
GCTGCTAGGA TTGTTTATGT TCTTAAACAA 
GGTGGAGTAT TCTTCAGATT TTGTTTTGGT 
AACATATTTG CTCCTGGCTT TGGCAGCGGT 
GTTTATGTAA CTACACATTG G C AT AAC AAA 
AATGT AC AT C CATTCCTGTT CAAGCAGGAA 
AGGAAAGATA CATCCTGGAG GTACCGATGC 
GAAAAGCCCT TTCCTGTATA AAAACAGCAT 
GGGAGATGGA CTTGGCAACA CTTCTCCTGA 
TGTATTGGGT TTTAATTTAA AG C AC ATTGC 
CATGGGGAAG ACT CGAATTT CTGCGTTAAA 
GTGTTGTGTC CCTTTAAACC CAAAAAGCCA 
TTGTATGAAT GGTCATGAAA AATAACTTCT 
TGAGGACACT GTATTATCTG TTGTGTTCTC 
ATCATTTCCG CCAATTCTCT CTTCCTTTGC 
TTAAAGCACT TAAATGAGAA GAAGAGTAAG- 
G CAGC AT C AG GGAAAACAGG TTTCTTCTTG 
AACTGCCTTA CAAACCAGGC TACCACTCTT 
ACTGCCTCCT CCTGTTGACT GCATAATGTA 
ATGATAG CTT TGAGCGCAGC TTTTCATTCT 
TT CAT CTTT A GCCTAAAACA AGCTCTACAG 
CTATAGAAAA GGAGTATTTT TCACTTCACG 
TATCCTTACA AG ACTT CTGG TATACAGTGG 
TTTCCTGGGT AGATCCCTGC TCACATATAA 
CCCAGACTGT TAACAGCTAC TGAGCCATCA 
CAGCCATAAA ACCACAGCCC TGCTTGCTGC 
TCTGTGCAAA CAGTTTGTCT GGTGCAGAAC 
AACACTTCTT TTGATCTTGT ATTTTCTCTC 
T AAAAGAAC C ATTTCCATTT GTTGCCCAAC 
ACTCTGCCCT CACGGTGGCT CCAAAACTCA 
T C AT AAGCAT GGCTTAATGA TGTGGAATCA 
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78 851 TACCTACACA TACTCCCTCT 

789 01 CCTTTGGTTC TCTAGTAAGA 

78 951 CCTCCACTGT TGCATTGCTT 

790 01 GAAGCCATGG AATTGGTGGC 
79051 CAGTCTGCTG CTTTTTCTAA 
79101 TGCCTTTAAG ACCTGTTTCC 
79151 GTTGGTTCTT TCCCTGTGGG 

792 01 GAAGTT C ATT AGCTCCTGTT 
79251 CACCTACAAC ATGCATCCTG 

793 01 TCCATTTTAG ATCTTTTACC 
793 51 ACATGAAAGA AGTGCTGAGT 
79401 AACGAGGGTG GAAGAAAAGA 
79451 GGTAGGATGA CTGCCCATTT 

795 01 ACTTCAGGGT GAAACAAGTG 
79551 TTTCAAGTTA GGTAAAAGGA 

796 01 ACCTTCATAT GTTCAGCAAC 
79651 AGCCTTTTTT CTTGTGGGCT 

79 7 01 CAAGCTAAGC TGCTCGCACA 
79751 GCAACTTCTT ACACCCTGTT 
798 01 ACAAAGATCC TGCCTGTCTC 
798 51 CATCCTTGCA GTCAGTGCAG 
79901 CTGAAAGCTA GAGCCACCCT 
79951 ATATAT TTTC CCCTAAATAA 

80 0 01 CTTATGTCTG CAAGCAGCCA 
80051 TCTGGTGTGC TGTCCAGAAC 
8 0101 GCTTTTTATG GTACAATATT 

8 0151 CCACACAACA GCTTTACAGT 
8 0201 TAAAAGCCTT TTTTTTTTTT 
8 0251 GCTGAAACTT AGTCCCAAAT 
8 0301 TAGGAAAGGC ATCAGACTGC 
8 03 51 ACAGATCTTG TCCTGCCATT 
8 0401 GCTGAGCTAT TGCTTCGTTG 
8 0451 CAGTGGTTCC TATCAAATCA 
8 0501 TGCATGTTTC CATAGGTAGT 
8 0551 ATACATGAAA ATAATTACTA 
8 0601 AATCCTTCAC ACTGCAGCTC 
8 0651 AGTTTCTGTC ACCCCAATCT 
8 07 01 AACTGATGCG ATGACACACA 
8 0751 CTGACTGATT TAAATACCAC 
808 01 GACAGAC C AC GGGAAACTCT 
8 0 851 AGATGGAAAA ACAATTCTCT 
8 09 01 GGGCCCCTTT CCAGATCCTG 
8 0951 GTGTGCAAAA TCAGCTATTT 
810 01 TTAGCAACCG ACTATGGGAT 
81051 TGATACAGAT AAGCAATCAG 

81101 TCCCAGTAAG CTGCAGGCTT 
81151 TGAGTTTTGC CTTGATCTCT 

812 01 AAAACATTGT CTTCTCTCCA 
81251 CTTCTAGGGG CACGAAACAA 

813 01 TGCTGAAATG TTCTGAGATA 
813 51 CAATGCTGGG CTTTGCAGCA 
81401 CACAAGCTCA ACTGCTGCCT 



31/79 

CCAAATATCC ATTAAGAAAG TTCACTAAAT 
AAGTTCCTTC TCCAGCCCAC ATCCCTTCTC 
TTCTGGGGCA GCCCTGTAAA TAGCTCACAT 
AGTGGTTGTA CCTGGACGTC ACTCTGAAGA 
AGG CATGG AC AC CT CTGTAC GCCAGACGCT 
AGCTCTCATG CTCTCCCTCT GTGCTTGGTG 
TTGGGGTGGA GGTGCCTCTC TTCTGTTGAG 
GTCTCCTCGA CGCCTTCTGA GGTCTAGACA 
AC CT AC AT T C ACAGTAAACA ACCTCTTAGA 
AGCTGTGAAA GTGGAGCAAC ACAAACTTTA 
TTTGTTTTCA GAAGGTTGTG AATAATAGCT 
GAAATGATTA CTGCAATGTG TTTTTCTTGT 
ATGTTAGGCC TTCATATGAA GTACTACTGG 
TCTTAGAATG AAACATATAT GAACTTTTTA 
AATAAATGCC TGCACTTGCC AC AT AT CAGC 
TTGACTTTCC TGTCAATCTA TCTTAGGCTA 
GAGTTCATTC CCATTGTCTG GGACTTGCTG 
GACAACTTGC TGCACCTCAG C AG AG C CAT A 
AACTTTGGTG CCTGAGCCCC C ACTTGT CAT 
AC AC CTGAAT GAGAGGCAGT GTGTGTTCCG 
GACGCTGAGT AGTTCTTGTC CCAGAGCAGG 
GACCTGAGTG CTTTCTCTCC ACACTGTGCT 
AATATCTTTC TGGAACACAG G C CACAGTT A 
AGAGCATATG CTTTGCTTTT CTTACATATT 
ATCCTTTGTT TGACACTAAA ATTGATGTGT 
TTGAGAAAAA CTTGAGTACT CCACTGCTAT 
MAR 

TATTTCCCTA AAGGACTGAT AAGGGCTTCT 
TTCAGATGGC ATTCTTCATG AAAAGACCAA 
TCTTCTTACC AGAGTGGATT TAATGGCCCA 
TGTATTTACA GTACAAGAGA AAAGAATGAG 
GAACAGGAAG CTTACAGACT TTCTGGGGCT 
TGAAATTGCC ATTCGTTATC CATTCTGAAT 
ATGAGGAGAC ATGAAGTATA CTGCAAACAG 
AGCATTCATA GCTGCTTACG TTCCTTCTTC 
GTAATTTTAC TTT CATGAAT CTGTTGTTTG 
AGGTTAC CAG ATGTGGTTAG ATGCCCGTGT 
GTCTCTAATC ATGTTGTTAC AAGAGGAAAG 
TTAAACTAGT TTGTAGAAGG AAATCCACGG 
AACCTTTTGC TTACAAATAA GAACAAGACA 
TTTGGAAGGG AT C AGAT AC A TTGTGGGATA 
CTAAGGAATT CTCATATGGT ATGAGTATTG 
CTGTATTCAC ATGAGTGTGA ATTAATAGAT 
CAAACTCAGA ATTCAGCACA CTTCTACTAT 
GATTTTAGGG CGGACAGATA CTTCACAGTA 
CTGATTCACA TTTCTCCTTT CCCTTTTTGC 

V gene exon 1 
CACAATGGGC TCCATTTCTA GAATGATTAT 
ACAATAAACT CAACAGAACA GCAAAAGGCC 
ATGAGCATCT CTACCTCCCT TGGCCTGATC 
CACTGCTGCT CAGATAGAAG AAGTAAGTAC 
CTTCCACATA GCCTGCTGTT CCCCCAGTGG 
CAACATGTGT GCTTAGGAGA CAAAGATAAA 
TGAGAGCAGT GCTTGGTGTG CTGTGATCCC 
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814 51 TGCTCACTTA TCAACTGTGA CATTCAAACG ATT CAACATG TCTCACCTAC 
81501 AGAGCACACG GAGCCTGGGG GTACAGGGTG GGCATGCAGA AGTCTGTTCC 
81551 TCTGGTCACC ATGCCTTTTA CTCCCTGCAG TGCAAGCTGT ATGCTCTGAG 
81601 ATCTTTTATT TCTTTTCTTA TTTGTTTCTG AGAGCAGTAA GTGACCAATA 

816 51 CTCCTAAGGT ATATGTGGCA TAAGG CAGTA GCTGGCTCTG GCTGTGTCCT 

817 01 GGTGGATCTT CATCCATTGT ATTATAATAT TGCCACAGGT CAGCTGCTGC 

817 51 CAAGGGAAAC TCATTCTCCT TATGAGGTTC TGAGTGACTC TTGCTTAGTT 

CRl-c 

818 01 TAGGAAAGCA ATGGAGATCG AGTACTCTCA ACAAGGGGGA ATGGCGTCTA 
81851 ACTAAAAGAG CGGAAATTTA GGTAAGATGT TAGGCATATA TTCTTTACAC 
81901 AGAGGGCAAT GAGGCACCAG CACAGGCTTC C C AGAGAAGC TGTGGTGCGC 
81951 CATCCCTGGA GGCGCTCAAA GCCAGGTTGG ATGGGGCCCT GGGCAACCTG 
82001 ACCTGGTGGT GGCATCCCTG CCCACAGCAT GGGGTTGGGG CTGAGTGGGC 
82051 TTTGAGGTCC CTTCCAACCC AAACCTTTCT ATGACAGTTA ATAAAT CTAC 
82101 ATCACTTATC CAGGACAGCC CAGTAAATCT TTCAAACAAG GAAAATGCCT 
82151 TTATCCCAGT TAAAATTGCC ATTAATTTGA CCTCTTCAAC TGCAGGTTCT 

822 01 CCACGTCAGC AATGC CGCAG GAACTACAAG CCTTGAATCT GAGCTTGAAG 
82 251 GTGCAGTGCC CGAAAACAAG TCTGAACTAA GCCAGGAAAG AGAGTCTTCC 

823 01 CCCTCTCTGG TATGTCTTTT TTAGTACAAG AGTCTTTCAC T C C ACAGTAG 
8 23 51 CCTATTAGTT GTAAAGCACC ACAGCCTGCC ACAGGAGGGA GTCAAGATCC 

824 01 CATGCACAAC GTCTGCCTGG TCTACTACGC CTGATTGAAG GTGTTCCCTT 
82451 GTAATCAGCC AAGTCCTCCA TAAAGT CAAA TACAAAGCCC CCACCAGAAG 

825 01 GAAGAT C AGG TTACAAAACT TAGATTAGCT GAATTTAAAT ATAATTACAG 
8 2551 TGGGAGCTAG CCCTACACTG CAATCTAATG AGG ATG CAAA TGAACAACCA 

826 01 AAGCTATACT GAGGAATACT TGTAATTGGT GTGTTTGAAA TATTCCTAGT 
82 651 GCAACACAGA TGGGAATCTT AACCACGAAG CGTTCCATGC ACTGCTTTTA 
82 7 01 CAACTACAAA ACCTTGGCAA AGACTATGTT TTAAGC CTGG CTAACAGCCT 
82 751 CTTTATCCAA CAAGGATTTG AAC CG CAT C A GGTAAGATAA CTGTACCTTG 

828 01 TAACCTCTGT GGCGCTGACC CCCAGCTTTC TGGCAACCAT ATGCTTCACT 
82 8 51 GTTGTCCCTC CATGTGTATT TTTGAGCATT GGAGGTGCTT CTTGGAGCCA 

829 01 TATCTCTTAG GGTTGTTGGG AAAGAGACAG AAGTATCAGC TTTCAGTGCT 

82 951 TCTGTTTAAA ACAAACAAAC AAACAAAGTC AAGACAACAC TCTGTAGAGC 

83 0 01 AAAAATAAAG CAGAAGACCT TTGACTTTTG G CAT AT CTAA CTTGAGCCAG 
83 051 AAGTGCGACT AC AG C AAAAA AATGGCCTAT TCAAGCTGTC TGCAAGCTGC 
83101 TTCTGGGCTA TCTTTCTATT TGCAGCTTTG CATTGCTGGC TTTCCTCTTT 
83151 TTCTTCTTTC TTTCTTTTTT TTTTTTTTTC CCCCTGCTGA ATGATTTGGA 

832 01 TACTTGAGAA TCACCCAACA CATCTTGCAT CTTCTCTAAT TTTTTTTTCT 
83 251 TTTCTATTTT TTTAAATTTT TATCTGGATA CCTGCATACT TCAGGTATGC 

833 01 AGTTTTCTGT GGGAAGACAT TGT CATCTAG AGGCAAAAAT GTATATAAAT 
83 3 51 AATAAGAAAG ACACAATAAT AATCTCTTTT T CAAAGATT A T CTGAAT CAG 

834 01 CTTCTGATAG TTGATGTTTC CAAAGCCAAA TTTTGTCTCT TTCAGTCAAG 
834 51 AAGACCCTCA GAATTTCTAA AACGTTT CTG AATTGTTGAC TTCATGTTAA 
83 5 01 AGAGAATAAG CTCTGAACAG GTTTGGCTAA TTCACAATCT TTATTCTGCT 

V gene exon 2 

83 551 TTACAGAAAT ATCTAATGTG CAGTAAGGAA CTATACAGAG CAGCCCTTGA 
83 6 01 AACAGTGGAC TTCCAAAGGG CTCTTGAAGC AAGCAGGCTA AAAATTAATG 
83 651 ATTGGGTTGA AAGCGAGACA CAAGGTAAAA CAGAGCAAAA CTGTAGCTGT 
83 7 01 GCTATCTTCT CCCTCTTCCA GTGCTCCTTC AAAAAGAATT CAG CAT ATG A 
83 751 TAAGTCTTGT TCATGTTTCT AGGTTTCTCA TGCCCGTCAA AGATAGTTTG 
83 801 TTGTTCCCAA TCATTCTTTA GAGTCATCTA CCAGCTAAAC TATTTCTGAG 
83 8 51 TTAAAGATGT GTTTGTTGTC ACATACTGTC ATACTCCTAC CCACATGCCT 
83 9 01 AGCAAGATAA CTGCAACAGT ACCTCTAAGG GTTAAATAGA TTAATTGCTC 

83 951 CTGCAAATAG CCAACACTGC AGGTACAGTA AAGCAGAGGA CGGAAGTTAT 

84 0 01 GAGCGTCACA GTGAGACTGG GAACAGCATA GCAGAGAGAG AAGACACCTG 
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84051 AGGACCTGGT GTTGAC CTGC TCTGGTCGTA CACAGAGCAA TGCTAACAAA 
84101 GATGAGTGAT GTGCCCACCA GAGAGATTTC ACTGTTACAA GTAACAACCA 
84151 ACCAGCTTTT GCCCTTTACA GGCACATAGA GGTCATTGGC TTTTTTTCTG 
842 01 ATTAAGCTGA ACATGAAATA TGCCACTTTT ATTTTGTCAG AGATGCAACA 

842 51 TCAGCAGGGT GAAAACCTTA TAAATCTTCC AGCTGAACTT AAGCCAGAAC 

843 01 TTACTGAGGG AAATTACTGA TGGATGAATA GATTTGAAGG CTTCTGATTT 

843 51 CTTAATGGTC AT AT C CTGAC CAAACCTGTC CTTGGGCTGA CAGAGCAGCC 

844 01 TGTGACTAAT GTGGGAAAGA GCTGCAAACC CCAGACCATC ATTGCTCTGT 

844 51 GTGCCTGTAC AAAGCCTGCG CGCTTGGGAA ATCCTACTTC ACCTCTGTAC 

845 01 AGAAAAAAAA AGGGTAAAGG GAAAGATGCC CTCATGTAAA CTGAAACAGA 
84551 GGATTAATGG CGCTGCGCCT TTTACTGTGG ACAGGTGCCA CCTGGAACAT 

846 01 TCATTTTGCC ACTGATCCCA CAGTAGGCTA ATTTGATGAT CGGTGCCCCT 
84651 TCCTCTCCCT AACAGGCCAG TACTAGGTAA CAGTGCTGAG AAATTTACCA 

847 01 TTTCTTTGCT TGTATCGTCC CTGTTCTGTG AAGAAACAAA CAGTTGGATT 
84751 TCTAAGGTAC TCTAAAGCTA AGTTCACAGA CAAGTAATTG AGTCTCAATC 
84801 CAGAGC CTTA ATAACAACTA ATAAACACCT GTGTTTTCCA AAATTTCCTC 

V gene exon 3 

848 51 CAGGTAAAAT CAAGGAACTT TTTGCTCCAG GAGTGATTGA CTCACACACC 
84901 ATTCTGGTGC TGGTGAACGT GATCTACTTC AAAGCATCCT GGGAACACAA 
84951 GTTTGAGGAG AAAAATACAG TACAGAGAGA TTTTAAACTG AATCAGGTAG 

85001 ATATGCATTG TATAAATCTT AGCATGATTT ACCTGAGTTA GCATGATTTA 
85 051 CATGAGTTGC AACGACT CAG CATTTTGTTT CAATGGCTGA CAAAACACAA 
85101 AGCTTCAGCC CTGATCAGCG CTTTTGAACC TAATAGTCAC TATGGGCAGC 
85151 TGTCATGGAT AGAAGC CAAT TGCAAAGATC TCATTTCACA CAGGCTCTGT 
852 01 GGGGCCATCC TGGCTTTTAT GCATCCCGTA CAATTCAGCG TGAGCCATGC 

852 51 AACAGATAGG TTAAACCAAA CCAATCAAAA AAAGAGGCCA GATATTAACA 

853 01 AGC CACAT AT ATGAAGATGG AATTTGAAAC AGGAAAAATC CTCACAGAGT 
85351 GTTTTGGTTT ATTTATAGTA TCTGCAATGT TTAAAAGGTT TTTTTTAAAA 
85401 TATTTTTTTT ATTTTGATTC CTTTTTTCCA CCGTACATAT AAAATGGAAG 

CR1-GG 

85451 TTTTCATTGC TCAACTAAGG T ACAGAATCA TAGAATTACT CAGGTTGGAA 
855 01 AGGACCTCAA AG AT CAT C AA GTCCAACCGC AGCCTAACCA TAGTACCCTA 



85551 


ACTCTAACAA 


CCATCTGTTA 


AATCATATCT 


CTGAGCACCA 


CATCCAAACG 


85601 


GCTCTTAAAC 


ACATCCAGGG 


ATGGTAACTC 


AACCACCTCC 


CTGGGGAGCC 


85651 


TATCCCAGCG 


CTTAACAACC 


CTTTCTGTAA 


AGAAGTGTTT 


CCTAACGTCC 


85701 


AACCTAAACT 


TACCTTGGCA 


CAACTTGAGG 


CCATTTCCCC 


TCGTCTTGTC 


85751 


AGCTGTTGCC 


AGTGAGAAGA 


GACCTACCCC 


GCTCTCACTG 


TAAGCACCTT 


85801 


TCAGGTACTG 


GAAGAAAATA 


ATAAGGTCTT 


CTCTCAGCCT 


CCTCTTCTCC 


85851 


AGACTAAAAA 


GCCCCAGCTC 


CCTCAGCTTC 


TCCTCGTAGG 


ACTGATTTTC 



8 5901 CAAGCCCTTC ACTAGCCTTG TGA AGCTGCA AAAAGTTCTT TAACAAC CAC 
85951 ATTAATCCAA GCTCTGTACA GCTCAAGTCT AACAAATGTC TTCAAAAAAG 
86 0 01 ATGATCAAAA CCATTTTATT TCATTTAATT CAGTTTTGTC TTC ATT C CAT 
86 0 51 ATGCTGTGCC TATGTTACAC TAAATAATGA AGCCGCCAAA AAAATGAACC 
86101 CACAAAAAAC ACAGATTTAG CTCTGATCTG AAGTTGAAGA GCTTTGTATG 
86151 GGAAAAACTG TATT CTAAGT GTTTCTTATC TATACAAACA AAAGGTCAGA 
862 01 AAGAC AT CTG TTGCTAGCCG TAGTGTTGCA CTGCCATTTA TTAAGACACG 
8 62 51 TAAGAAAGTG TAATTTTGGT CCCTTAATTT TTTTACTTGA AATATGTCTT 
86301 TGAATTTGAA TACTGAAAAC TGACCTTAGG TAGGAACATT TGGAACACTG 
86351 CTGCAGTCAC AGAAACTATG AGATTGGGGG AATCTGCATA TACTTTTCTT 
86401 C ATG C ACT AA TTAATAATGT TCTCTACTAA AATTCTTCCG CTGATTTAGA 
8 6451 AGGTAAGTAA AAACTTAGCT AATGGTGAAA TGAACCTTGA GCCTTTACAC 

865 01 AGGATTTGAA CAAACTCATC ACAAAAGAAA ATGAGGCTTA G AAGAC CTAG 
86551 AAGAACATGC CTGAGATTGC TCTTAATCTG TCTATTGCTT CCTGCCTAAA 

866 01 ACATCTACCT GATAAATGAC AACCTGATTC CTGCAGTGCT ATTTCTTCTC 
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86651 TATCCCATTC CAAACCAGGA CTTGCAAATC C CATCAGC AT CAGCTTGTTT 
86 7 01 GGCTGGAGAG TAATGGTATT AAGCCACTTC ACTATCTGAT CAGTTGCAGG 
86 751 GAAATTGCTT TGTTTTATTT TGCCCCCCAG AGAATTATCT CCTTTATACA 
86 8 01 TGAATGGCAA AACTGATGTT TTACGTGTCG TTGTATGTGC AACAAAATAA 
86 8 51 AGAAAAAATG TTTAGCTTTA TAACAATTAC TGCTGCAAAC ACAGACTACT 

869 01 GATATTGCAC CTGAAGTTTA AACATTAAGG TCTGTATTGC TTGTGTGATC 

V gene exon 4 

86 951 ATTCCAATTT CTTTTTAAAT AGAATGAGAG AAAGCCAGTA CAGATGATGT 

870 01 AT C AGAAAGG CACATTTAAA CTAGGCTATA TTGAAGAGCT GGGAACTCAG 

87 051 GTGCTTGAAC TCCCTTACGC TCAGAAGTTG CTTAGCATGA TCATCCTGCA 
87101 CCAGGAGAGA CAGCAGATGG ATCTCCCAGT GGGGCTGGAA CAGGTAAGGG 
87151 TGAGGACTGC GGCTAAGCCG GACTGAAAGC TGGTTGTCTG AATTAAAGCT 
87201 GGG CAAAAAT CTAAACTTGT TAATTTCCCC ATCTTCTAGA CTGAAAGCAC 

V gene exon 5 

87251 AATGACCTAT GAAAATTTAA TGCTGTGGTT CTCTTCCGAA CATATGTTTG 
87301 AGATGGTGGT AGAGGTGTAC CTGCCCCGAT TCAAGCTCGA AGGCACCTTT 
87351 GACCTCAATG AGGTATTAAA AGCAATGGGA ATGACTGACA TCTTCAGTGA 
87401 ATCCAAAGCT GATCTTTCTG CATTGTCATC TGAGAAATCC CTGGTGTTGT 
8 74 51 CAAACATTGT CCACAAGGCT TATGTGGAAG TCAATGAGGA GGGTACTACA 

875 01 GCAGCAGCTG CTACAGGAGC TACCATTGTG AGGAGGTCTC TTCCCCTCAT 
87 551 AGAGGTGTTC ATAGCTGACC GTCCTTTCTT ATTCTTTATT AGGCACAATC 

876 01 CCACCAGTAC CATTCTTTTC TTTGGTAAAT TCTGCTCACC TTAAAATCAA 
8 7 651 GGCCATCTTC TAGCATTGTG AGAAAAACCT GGATGAATCA GAAATACTAT 
87 7 01 TTTTCCCCCT ACACCTTCTT ATTCCTATGA ATGATTGTAG ATCAAAGTAA 
87751 TCACTGCAGC CAACCTAGCC T AGAACC AT C AATTGAATGC CCTCCTGTTA 
878 01 TGCTCCTTGA ATGG CAAAT A TTGATCTGAA TCTAAAACAG GAGTAAGTTT 
8 7 851 TCCCTTAACC TGACTGGAAA TCAAGAATAT TTTGTTTCTT CAAGGCGTAC 
87 9 01 ATACACTCCT GTATAGCCAA GTATGTCCGG CATAGCCAAG TAATGTAGTA 

87 951 CACTATTTGC CTGGCAAAGG TAGAATTTGT ATGCTGCTAC CTGAGGAGAA 

88 0 01 CTGTTTGTAA CAATTTTCAG TAACTGCCAG TAAAAGTGGA GTATTTTTAT 
8 8 051 TTTCTCTGTA GTTTTTGATT TCCTGCCAGG TGGGACTTGA TTAACAGAGA 
8 8101 GGGGCTTTGG AAATGCTTTA TACTTATACA TAAT CTGTAT TTGTGGCAAA 
8 8151 TCCTTCGCAC AGTGGAGATC TCACTTTGAT AATTCCCTTT CCTGTAGCAG 
8 82 01 CAGTCACAAG CAAGCAGGAA ATACTTATTT ACAGCAAATT CACGTGTTTA 
88 2 51 CTGACAACTG TACCACCTTT CCCCCCATGA TGTATGCTGG ATCTATCCTT 
883 01 TTGCCATATA AAACGTTTAT GCTAGAAGCA GCTTTGGTTT CATTTATTTA 
8 83 51 TTTAGATATA AGCCTGCATC TGAAGCACCA ACTCATCAAC TGGAAGATAG 
8 8401 ATGGAATATG AC ATATACC C CTTTCACAAT CCCTTGGTTT TTTCCACATG 
88451 AGTTCTGTTA GAAGCACTGT ATTTTTCCTT TTTTAAGATA ACAACAGTAG 
885 01 GAACACT C AT GGAAAGGACA AGATTACGCC TCATGAACAC AT CTAGTAAG 
8 8551 AGAGTTGATT ATAACAGCAA CTGAGTATGT GGGAAGGCAA GATTTTGACC 
88601 CTCGTTTTAC AGGATTTTTT GGCACTCTTT TTTGAAAATA AATCCACCCT 

CR1-GG 

88651 T A AAGAAT C A CAGCATGGTT GATGTTGCAA GGGACCTCTG GAGGACATTT 
88701 TGTCCAACTG TCCTGTTCAG GCAGGGCAAC CATGTC CAGG GGGCTTTTGA 
88751 GAATCCCCAA GCACAGAAAC TTCACAACCT CTCTGGACAA CCTCTTC TGA 
88 8 01 GTTCCCACAA TTTTGAATGA CACCAAAGAG AATTTTGTAT GCGCAGTGTC 
8 8 851 TGCAGGAATG GGATGTGAAA ACACACATTT CTAAAGCTTA ATTACTTACA 
8 8901 TAGTGAAGTA ATTGGTTTTC TTCCTTGAGT TCTGCTCTCT GGTGAAGTTT 

88 951 AATGATCTGA GATGCATGTA TATAGATATA CAGGTCTCTC CAGCCCTGAG 

89 0 01 GAATGAAGAA AAGTTTTGAA AAGGGCAATG TAAGCAATAG AAATCACAGT 
89 051 CAAATATTAC CTGGAAAACT TTTTAGTCTG AGAGATAATT AGAAAAATAG 
89101 AATTAGCAGC TGACTGATAG AGAGACATAA CTGTTAAGTT GCTGGTTTAA 
89151 CACAAGTAAT ATCTTCCTCA CAGAGTTCTA TGTGAGGTTT AACTAACTAG 
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892 01 CGTTGGCAAC TTGTGCTTTG TGACCTATAA AAAGGCAAGT AT AC ATTAG C 
89251 TATTAGT CAT ATAATTGAGT GTAAAGCTCC ATAAAGTAAT TCATGATTAG 
89301 CACAGTTTAT GTACCAAAAG TTACCTGCGG CTCTTTGGAT AAGAAAGTCT 
89351 AGGCATGATG TTCGAGCAAG AACAGGCAGG AGTAGGACAA TAATATTCAA 
894 01 ACAACTTACC CTTACTGACT AAT CTGAAAG CACAGTACAA TGTAAGCAGT 
89451 ACTTTTCCAG ATTGTGTCCA TGTTTCCATT CTGGAGGCTG ACAGCACAGA 
89501 TTGCCTACTA AGCTATGTTT TTATTACCTC CAGGTGTCAT CACTTGGTTT 
89 551 TTACATACCC TGGGGAAGTT CTGAGCACCA CAACCTCAAA CATCAGTCCC 

896 01 ACTTCTGCAA CGACAGGAAC AGAGATT C CT GTGATGAAGC GTCGAATAAC 
89651 ACAGTGTCTT GCTCCAGTTG TTGGAGGAGA TGGTTCATGA TAAAT CTAGA 

897 01 GTGAGATTAA GACACAGATG AGGTCAAATG TCATCCAGCT AGTTTATGAC 
89751 AAATT CTAAG CAGTTAAGGA ATGTGGGAAA CAT GGC AAAG TTAGCAACAG 

898 01 TAAAGGGAGG AATTCTAGCA AACTGGCTAT AGAGCAGGGA TACTCACCCC 

89 851 CATGGATCTA GCAGTATCCC ATTGGTTTGC AGGAGGTTGC AGGTCAGTCA 

899 01 AAGACATATC ACTGATCTGC ACAGCTGCAG TTCAGTGGAG GATTGTCTCT 
89951 GTTCTACCAC TGAACTCTTC AGGCTTTATC CTCTTCATTC TGCTCTCATG 

90 0 01 CACCTTCAGT TACTCAGGGC CAATGGCATG TGTGCCTCCC ATTGGGTGAT 
90 051 CGGCTGTTGA TCATGCAGCA ATCACACACC TGCCACCTGG CACGCTGTTC 

CR1-GG 

90101 GGCATGTGTA CTGACTTA AT GGAAGAGACC TTTTAAGCTC ATCTAGTCCA 
90151 ACTCCCCTCC ACTGAAG AGG GACACCTACA G CT AG AT C AG GTTATTCAGA 
902 01 GCCCCGTCCA GCCTCCTCAA TGTCTCCAGG GAAGGGGCTT CTACCATATC 
902 51 TCTAAGCAGC ACATTCCAGT GCCCCACCAT CCTCACTGTA AAAGAATTTT 
90301 TCTTTATATC CAAGCCAAAT CTCCCTTCCT TTAGTTTGAA ACTATTTCCC 
90 3 51 CTTGTCCCAT TACAACAGAT CCTACTAAAG AATCTGTCTC CTTCTTCTTA 
90401 AGAGCTCCCT TGAGAAGGGA GCTCTTCTCA GGTCACCTTG GAGCCTTCTC 

904 51 AT AT C C AGAC TGAGCAGTGC TAGTTCTCAG CCCGTCCTTG TAGGGGAAGC 

905 01 ATTCCATCCC TTGGATTATT TTCCTCTGGA CTCACTTCAA CGTCCATGTC 
90551 TCCTCTGTAC TGAGGACTGC ACATTTGGAT GTAGTACTCT AGGAGAGGCC 
90601 TCAC CAG CAT AGAGCAAGGG ACAGGATCAC CTGCCTTGCC CTGCTGGCCA 
90651 TGCTTCTTTT GCTGCAACCT AAGATACGGT TGACTTTCTA GGCTGCAAGG 
90 7 01 GCACACTACT GACTCACGTC C AGATGC C AT CTAC CACAGT ACCCCTAAAT 
90751 CCTTTTCTGG CAGGGCTATG CTCCCTCTTT TCGTATTCCA GCTTGTAAAT 

V gene exon 6 

90 8 01 GTAGTGGGGG TTGCCATAAC CCAGGTGCAA GACCTTACCT TTGGATTTGT 

908 51 TGACCCTCAT GAAGTTCTCT CGGGCCCACT GCTTGAGCCT GTATGGATCC 

909 01 CTCTGAATGG CATCTCATCC TTCAGGAGCA TCCACTACAC CATACAGCTT 
90951 GGTGTCCTTT GCAAACTTGC TGAGGGTGCA TCAAAATCCT GTTGACAATG 

910 01 TTACTGATGA AGACACTAAA GAGTACTGAT CCCAGTACTG ATCCCTAAGG 
91051 AACACTACTG GTCACTGATC TCCATCCAGA CATTGAGCCA TTGACCACCA 
91101 CTCTCTGGGT TTGATCCCGC AGCCAGTTTC TAGTCCACTA GTCAGCACAC 
91151 CACTGATCAT AGCCACACTC GAAGGGG CAG TCATGCAAGC ACCACCCTGG 
912 01 GTATTTATTT CCCAGCACTC T AAAG C AGAG CTCTTGCTCC AGCTCATGTT 

912 51 ATTTTCTGTG TGGCAAGGAG TGAGATT CAT CGACTCTAGC AAATGGAACT 

913 01 AATGGCTCCA TGTGCCCCAG GTCTCAGCTC AGCACCAGCC AGGCCAGGGC 

V gene exon 7 

913 51 TGAGTCCCCC CACATCCAAC C CAT AAGGT C CCAGAGGACT CCTACGTTTA 

914 01 CCAGTGGTGC ACAGAGATGA GTTTAGCCCA AGTCCACCCC TCAGCCTCAA 

914 51 CTCCCTTCAA CACCTCTTCA CCAAGAGGCC CAATCCATCA CTCCTTACCA 

915 01 GCCAAAACAT ATACTTGTTT AAT AC C AC AG CCACAAAAGC CACGTGGTAA 
91551 GGT CTGAAGA GACCAAAACT GTGGTTTGAG TAAAACAGAA GGAAAGCCTC 

916 01 TACTCAGTAC CCCACTTATG ACTGAGTTAC TAGGATAGGA CCTGATTCTA 
91651 CAGCACCCCA ATACCCTGTA GATGTATTCC TTTAATTCTT CACACCAGAT 

917 01 TAAGGCTGCT GCCACCACCC ACCACAAATA AATCCTTGCT TAGGCTGATT 
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91751 ATAACTTACA CCTGTGGCTT CCACAGTCAA ATGAGATTCC CAGTGCCCAC 
918 01 CTGCGTGTTC AACTTCCTTA AGGCAAAGCA TCTTGCAGTT AGCAGAGTGT 

918 51 TAAGAAATCT TCTTGTATTT CCTTTAACAC ACGTTTATCT TCCCCAGTGA 

919 01 TGCTGAATTT GCAAATGCTT TAGGGAAAAA TTGGCAGCAA GTCCTTACAT 
919 51 AATTACTGTT TAGC CTAGAA AATAACAACC GAGGTAGAAT ACTTCAGAAA 
92 0 01 GTTTCTAATT TAAGGTTTTT TTCTTGATGA GAGAAAAGTG CTATCAGAGC 
92 051 TGTTTAGTAA TT C CAGT CAT GCATGGGTAA CTCATTCTTC TGTGTTAGGG 
92101 TTTACTGAGA GGTGAAGAAA CAAGTAGTTT CTTTTCCTTA TGAAAAAAAA 
92151 AAAAAGTGGT ATTAGAAGAA CCCCATAAAA GAATGC C AAA CATTGCAGCT 

922 01 TATGATGTGC AATGTGT CAC TCAGTCTTAC AGATGACACA GCCTGGAAGT 
92251 AAGCTTAAAA AAAATGTTTA ATTCCTAACT TCTTTTGACA CCATCTGTGC 

923 01 TGTGGTTTAT GACATC C ATT AATAATGTTT AT C ACT AAAC AACAACAGAT 

923 51 AGAGAGACCA GAAACTAAGG ATGCTGCTGT CATTTCCTTC TGATGCAAGG 
92401 T AGAAAC AT C AGGAAATTAA GGCACACTGA AATATTTTGT AATATTTTGG 

924 51 ACTAGAAG CA AAACCAGAAA CTGAGTTGCA TTTGTCTCCT GGAGTACATT 

925 01 CTACAGGTAT TTAAAAAGAG ACAAAAACCA TAAATCTACT TGAATTTAAT 
92551 TTGAAGTATC AAATGAAAAA GATGTACCTG ATTTTATTAT CCTCCACACT 

926 01 GGTCTTCTGA ACTTGACCAA TCCCACTGGT CAGTTACTGG TTTACGACTG 
926 51 CTCAAGCTGT TTGTAGCAAC TATGTTGTAC CACAAAATAT CTGAGCCATT 
92 7 01 ACAAAACAGA AGAGTCATTA GGCATTTTAT CTCCAACCCA AAGCATACAT 
92751 GCATGTTTTA AAAT CT C AAA TTCTCCTGAC TTTAATTGTG CATATTATGT 
92 8 01 TCACCAAACC TTTTAGAACC TGCCTTGTTT TTTTTGTTCT GGTCTGTAGC 
92 851 TGGGAGT CAG AGAAATTCAA CTGTGATTGG AAAAATGGTT ACTGGCAAGC 

92 9 01 TATAGAGTTT CTAAGC CAGA AGGTGAAGAA ATACTACTTT TTTAACACTC 
92951 TTGGCCTGGG ACTAGACTTA CAGACATGAT CAATATTGAA AGGCAATTTG 

93 0 01 GAGGTATACA TTTTAACATG TCCTCAGTCT GGAGTTAGCT GTGTGTCCAG 
93 0 51 TTTCCTCTCA GTGTGAGTCA AGCAATAGCA TTAGAAAGTT ATGCCCCAAG 
93101 TCTCATCCCC TCCTATTGAA ACTTGGCACA G CAC ATT CAG GCTGTAAGCC 

V gene exon 8 

93151 ACCAGGTCAC AGCCCCCTTA AAGGATTCGG CAACAGCTGT GGTTGCTATC 

932 01 ACATGGTGTT GATCATCGTT GGGCCCCTCA CTGTAAGAAA GACATTGAGA 
93 2 51 CCCTGGAGCG TGTCCAGAGG AGGGCAACAA AGCTGTTGAG GGGTCTGGAG 

933 01 CACAGGCCTT ATGAGGAACG GCTGAAGGAA CTGGGATTGT TCAGTCTAAA 

933 51 GAAGAGGAGG CTCAGGGGAG ACCTTATTGC TCTCTATAAC TACCTGAAGG 

934 01 GAGGTTGTAG TGAGCTGGGG GTCGGCCTCT TCTCTCGTGT GACTAGTGAT 

CR1-L 

934 51 AGGACTAGAG GGA ATGGCTT CAAGCTGCGT CAGGGAAGGT TCAGGCTGGA 
93501 TGTTAGGAAA TACTACTTCT CTGAAAGGGT GGTCAGGCAC TGGAAAGGGC 
93551 TGCCCAGAGA GGTGGTGGAG TCACTGACCC TGAAGGTGTT CAAAGAGTGT 
93 601 TTGGATGTTG TGTTGAGGGA CATGGTTTAG TG AGAA C CAT TGGTGAAGGG 
93 651 CGAACGAATG GTTGGACTGG ATGATCTTCT GGGTCTTTTC CTACCTTAGT 
93701 GATTC CATGA TTCTATGATC ATTACACTGG ATTTGATACT CTGTGAGCAA 
93 751 AGGCATTGAA GTGGTACAAA AAATT CAACA TTCTGCATTA AATTGTAGAA 
93 8 01 TCTGGCAAGT GGAAATCGTT TTCTATAGGC ACAGCCACGC ACT CAGAATG 
93 8 51 TGTTTGCAAT TTGCTTGCAT TTAGTCTTCT GCAAGTAATG ACTGCTTTCT 
93 9 01 GTATGCAAAT GATTGATCCA TGTGAAAAAA TCTGCTTGTG T AT CTGTGAA 

93 951 TCAAATG CAT TGCTTTATAA TGTGCATTTT GGATCATTTA TTTGTGGAAG 

94 0 01 TAAGTGTAAA AAACAGAGCC TGCAATTGTG CTTCTGCAGT ATACAAGGCG 
94 0 51 TTACTCAACT CCAGCTGTAC AGTCAGTCAG GCCCTGAGAT AAT CTAGACT 
94101 TATACTTTCC ATAGTTATTA TAATTTTGTC TCTTACTAAA TCTTTGATTC 
94151 TGCTTGTTTG ATAAAGTAAC ACTCATTTTC TATATAGTAT TACAATCGCT 
942 01 TCTAGAAGGC ATTACATCAC TGAATTCATA GGCTTTCTGA AAAACAGATT 

942 51 CAG AAAT CAG ATTTTCTAAC TGTATTTTTC CATGTATATG TAT TGGAGAA 

943 01 CTAGTGAAGA ACGTGTTTAA TATACAGAAC TACAGATAAA TCCAGAAAGG 
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94351 
94401 
94451 
94501 
94551 
94601 
94651 
94701 
94751 
94801 
94851 
94901 
94951 
95001 
95051 
95101 
95151 
95201 
95251 
95301 
95351 
95401 
95451 
95501 
95551 
95601 
95651 
95701 
95751 
95801 
95851 
95901 
95951 
96001 
96051 
96101 
96151 
96201 
96251 
96301 
96351 

96401 
96451 
96501 
96551 
96601 
96651 
96701 
96751 
96801 
96851 
96901 
96951 



AGAAGCAACA 
TGGCTTGAAG 
TTTACCATGC 
CTTTTGCAAA 
GCGTTGATGT 
AAGAAG AT TG 
GGCATGCCAA 
GAGTGGACTT 
AGAAATCGTG 
ATTCTTGTTG 
TTTTAGCATA 
TCTGAAGCTC 
AAAGGGAAAT 
ACCTAGTAGG 
GTTTTAATGT 
GCAATAATGA 
TCTCGCCTTA 
GTTGTCTGAA 
C C ATT AC ACT 
TTGCGCCCTG 
CAGCACT C AG 
TGAAAACAGT 
GACAAGAAGA 
AATGATGATG 
ACCCCATGCC 
TTTTGTGCAA 
CTCAAGAAGA 
CATTGGAGCC 
AGCTAGGGGT 
ATCTGTTCCG 
CACAAATTAT 
AGATGCTTAT 
TGCAGATACT 
TCAGCCACAT 
TTTAGGGGGT 
GGGACAAGAG 
CATTATAATT 
AAATATACTG 
GTGGTTTAAA 
AATTTGGTAG 
TTTAATTAAT 

CACCCTGCTT 
ACATACTACT 
CTGAGATATG 
ATTTTGATTG 
GGAGGGCTGC 
GCAGAGGCAA 
AACACAGGCT 
TGTGTGTTTT 
TGTATTAAAA 
TAGCATATAT 
TTCAAGGAGA 
AAC AT CAT C A 



CTCAAAATAA 
CATGTCCCTC 
ATGTGCAAGT 
TTCCCTTGCA 
GAGCAGTGAG 
AAGCACAGGT 
GCACAAGCTA 
CATGTTTTTA 
TGACTCAGGG 
CAATATGGGC 
AGATCAGTGG 
TGCCCTAATT 
TCAAATACTG 
TGAGTTCTGG 
TTAAGAGAAA 
CAGAGAACTA 
ATTC CATAGT 
TTAAGCAATC 
CAATGAAATC 
AGCAGTGCTG 
ACCACAACCA 
AGGTGCTGCT 
TGTGCTTTGG 
GGAATGACTC 
TCCAGCCTGG 
GCCATCCATC 
GCCTCATCAG 
TCGGGCTCAC 
TAGAGGAAAA 
GGAGCATCAA 
TCAGTGAGAG 
TTTGTGAATT 
TGGCACATCC 
CTACAGAGTG 
AAAACCAGAA 
AAGCTAGGCT 
TGGCTAAGAC 
ATGGCCTAGT 
GGCATAGCTA 
CAACCAACAT 
TTATTTATAT 

TTCCTGTTAC 
GCAkTTTCAG 
CAGGAAAAAA 
CTAAATAAAG 
AATTACATAT 
AGTTTGCATG 
GTGGACTGCT 
TCAGTGAATA 
TGAAAAAAAA 
TAGCTTGTAT 
ACAAAATGTA 
TGAGGAGTGC 
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GGATGTGGCA 
CAAAAAAGCC 
TTCCAAACTC 
GATGGCATTT 
GTGCTTTTCT 
CTGTGCAGAA 
TTCAAATATT 
AGTGGAAGTG 
GGTGAAGGGC 
CTGCATCTTC 
AAAAAAATAT 
GGGATGATTG 
ATGATAACTC 
CTTCCAGTCC 
ACAAGTATTC 
TTCTGCTCTA 
CTTCTCTTAG 
ACTGTGTGAC 
AGTTGCTGGT 
AGCACAGCAC 
GCCTCAAGAC 
CCTGAAGCAT 
TAGTGTGTGG 
TGGAAGACAC 
GCAACTCCTC 
CAGAGGTGCA 
GTCCATACAC 
TGCACAGTTC 
ATGTGCTGTA 
GGCTTCCTGG 
GGAGAAGTGC 
TTTCTGTTTC 
TTGAGGCGAT 
CCTCCTCCCT 
TAGCTGTGCT 
CCTGGTAACC 
TTGAAAATGC 
CCCATAAACT 
AGAGGTTGCA 
CCAGCATTCA 
TAAATAAAAA 
MAR (0.81) 
TCTCAGTTGG 
AGCTGCAGAC 
AACCCTGTAA 
AAGACTCGTG 
GAAATATAGT 
CCAATTAATT 
CTGATGAACT 
CCATGGTATA 
AAACAACAAG 
AATCAGACCT 
TAGTATGTAT 
CAGTCTGACA 



AT CCTAAATA 
ATCTGAGAGA 
TGCAGGTATT 
TGCTTTGCTT 
CATGCTGAAA 
CATCTAGTGA 
GCTCCCTGAA 
GTACATAACT 
CTATCCTCAG 
CAGCACTGTC 
ACACAAATAT 
CAAATAAATG 
TGCAGTTCAA 
CTGCTGCTAG 
ACACATGGGT 
TAG CATT CTG 
TACCACGTCC 
ACCTACGTCG 
CTTCTGTGGA 
CTGTTTGCCT 
ACTCAGCAGA 
AACAGCCTCC 
TGCTCATTTC 
AC CAGAGGCC 
CTTGCTGCCT 
GAGTGAAAAC 
ACTTCAAACC 
TGCTGAAAAC 
GTTATCAGTG 
AGAGAACATT 
GCCTCTGAAC 
CCTCTTCCTG 
TCAGCAATAT 
GAGAGGAGAA 
TGGACCTCCT 
TCAGGAGG C A 
AATTTGTTGG 
ACCTTCTAGA 
GAAAAGAAAG 
CAGACTCATG 
AAAAAAATCC 

GAAGAAAGTA 
TTGAAGAGCT 
ATTACAGTAC 
ACAGTCCATG 
CTGAATTAGG 
GGTAAAAGGA 
GAGTATGTAA 
TGTCTGGTTT 
AACAGTGAAA 
ATATAGTTTT 
GATAAGGATA 
ACAGGAAAAG 



GGCTGTAAGC 
AAATTTCTCA 
TTATTTTCTC 
GCTCTGAACT 
TACAAGAATA 
ATGTATTCAG 
AATGCAGTCA 
TCTGTAGTGG 
TTAATCCCAT 
AGACTCCAGG 
ACCCCTTGCT 
AAAAAAAAAA 
CAACCAGGAC 
GACTATTCTT 
GAGTACCCTA 
ATAGTATGAA 
CCCAGCTCCT 
GAG CTTAGCT 
AAATATACTA 
AATATTAATG 
AGGAATATTA 
AGAGATGGAA 
CTTGTT CATG 
TCTGGTGTAT 
TTTTGACTTG 
AAC CATGGAG 
CAGAGCAAAA 
TGTGATGAAG 
CAGCTCCATC 
AT C AGAAGGA 
GCTCTGAGTC 
TTATGCTTCC 
ATGCT CAT AT 
AAATATTTGT 
GCTCTGCTGT 
GAGGGAGGCA 
TATATTTGGT 
TGTGGAGTAA 
GACCACATCC 
AGAAATACCT 
TTTGATGACT 

ACCGCTGGGT 
TTCCCAAGTG 
CAGGCATTTA 
ACTACGTCTT 
AGAGTTACTG 
GAGTACGCCA 
AAAAT AG CCA 
GAGTCAAATA 
TAAACAGTGC 
CAAATAAATC 
AGTACTATAA 
GAATTCAGCG 
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970 01 TGTGAATGAA GGGGAAAGTG TGACTGAAAC AATTGTCACT CAGCTTACTA 
97051 CAGCAGAAGC AATCATTTAT GATCTTAGAT TTTTTTTTAT TTTTTTTTTT 
97101 AACTTGCTTC AGAGATATCT AAGTAATCTC AAAAACAGGA ACAAAATACC 
97151 AACGCAAGGA AAAATTCTAT TTTCGCTTCA TATAATCTTT TCTTTTTTTT 
97201 TCTAGTTGCA TTCTTACCTA AAAACAACAA CAACAAAACA TTTAAACAAT 
97251 GTTTAAATGT TTACTGCTGG TTTGATTACA TCAAACCGAG TTGTTGCTGG 
97301 AGATGAC CAG CTATCAAGGT GCATAATGGA CTGGCAGATG TGCTTGGTCT 
97351 TACCCCAGGT TGCTGTGCAA ACACAATACA CATTGACATA TAAGCTACTA 
97401 TGAGTTCTGA AGGGCAGTTT AGACATTAAT TCTACTCCAG GCCAGACACG 
97451 CTGACT AT CT GAGTGGTTTA TAGCAAGGGA CTGGTTGACT TCAAAGTGGT 
97501 TCCAAGTCAA CCACTGCCAA GTGCTTAAGA CTGTGTATGC ACAACAGAGC 
97551 TGATCATCTC CAGTGCAACA AATAACATGA GAGCAAAAAG CAT CTGAAAT 
97601 TCTGTAAATG AGGCTGTTCT GGCCACACCT TGGCTCATTA AAAGACTTTG 
97651 AGAGATGCCA GAATAGCCTC TGCTAAATGT GATGCAGATG GACAAGCTAT 

977 01 GGAATGAATG GGTCCAGGGC ATAAGGAAAC ATTACCCTCA AGCACTACAC 
97751 AGGAGCTGCT GAACAACCAC AGGAAAGGAA ATGTGAAAAT GTGAACAGAT 

978 01 AAATGTTGGA AAGAGCCGCA TTTCTGCTGC TTACTATGTC CTTGATTATG 
978 51 CCAACATTAA GGAAGAATGG CAAACCCCGT GA ATTGGTTT AGGAACAGCT 

Y:OV-l element 

97901 CTACAATGGA CTGCCTGACG GAGGAAAAGG GCAGCAGAGT CCTTGCTGAC 
97951 CTCTTTCTGG TACAA ACACA GAT CTGGAAC AGAGTTTAAC CAATTAGTCT 
98 0 01 TGCTTGCATT CATGCCTCTT GAATTT CAAG AGGTGCCTTT GATTTCCCCT 
98 051 GGCCTAACAC CCCATCTAAA ATTACAAAAC CATATTTTGT CTGCTGAGGA 
98101 CTGTGCACGG ATAGCCCGTT CTGGTCAACA TACTCAGGCT GCTTCTGCAA 
98151 CAAGTTTTGC ACTGGCATTC AGTGTAGAAA AAATGCAAGA CCTGTGTAGC 
982 01 GGC AGACTTC TCTCTGGAGA ACATGTATTG CCTCAACTAT CTTACCTGTG 
98251 CAAAACTGTT GTGGTGACTG TGCTATTGCA GAGGTAGAGT GTT CAAAGAA 
98301 GGCAAACGTA CTGAATGAGA GAACACATCA AAAACACCTT CATGCCCTCT 
98351 TCTAGGGGAG ACAGCGAAAC AAAATGTTTA TTGAGAAAAT CTTGGACATC 
984 01 AGTCCAAGAG ATGAAAACAC TGTCCATATG TGCAGGGCTG GTTGTGTTCT 

984 51 ACAGGTCCAT G CTGCATAGA TGACCACAGA GGACAAAGAC ATTGAAACCA 

985 01 AGCATACAAA GGGCTGTGGG TACCCAGGAA AGTTCTTCAA GGAAGCCTTG 
98551 AAGGGATGTT TGAGTAC CC A CCTGACCTGT AGCTGCAACC CTGATGTAAA 

986 01 CATGTGAAAA TGGGAGCATA AGAGAAGACA CTACACACTG CAACAAAACC 
9 86 51 TGTGCCCTTG GGGAGGAAAA GTTTGACAAG ATAAAGTAGA AGCTATTGAA 
98701 AAAGGAACAT TAAACAAGAC AGGAGGAAAG CTTCTTACTA TCTGTAGATT 
98751 TCCCTACTCC CGACATGACT ACTGTCATGT TGACAGATAA AAAATACTCA 
98801 TTTTGAGTGT GGAAACTGAA AGCCATTCCA GTTATCATGG T CTGCACAT A 
98 8 51 CACACATGAC TGAATTTCAG CAACACAAAA CACAGTGCTT ATGATAAAGG 
98901 AGCTCCCTTT TACCTTTACC AGTGGGTACC ACCACCACTG TGTACTGTCT 
98951 GTCTTAATGT GCAAAAATTT GGGATTTCTA TTATTCATTC CCCTGGCCTT 
990 01 AACAGAAGCT GGATTTTTTT CTTTAGTGCT CAT CAAGGG C ATTATTCAAT 

SDRE fragment 

99051 AAAGAGTAAT AGCTTTTTAC AATTGACTAA TATTTGATAT TGTGCATTAT 
99101 GATTGTCTAA CAGAC CATGA ATGTTCCTTC AGACAGATTT GGTAGTTTAT 
99151 TTACCTGTCA TAGTAAAATA GGAGGTACAG AAGATCTATG AGAATAGCCT 
99201 GTGCATGTAC AATGGGCCTT GTTGC CATGA CCTATGAAGA ATGAAAATCA 
99251 AAAGCTGACC AC CAAT CATC CCTTGAATTC CACTGGCTGT TCAGCATTCA 
99301 CTTCTGAATA TCTGAATACT CTGGAGTCTG CCTTCGCAAA G C AGCAAAT A 
99351 CTTTCAGACT GTTCCCTAAA TCTCTTCCTC TTACCTATTC ACACTGAGTT 
99401 CTCTAATTCA TCCCAACACC TCTGCTCTGA ATTTTTTCAT AAGAAGCTTC 
99451 AGCAAAATGT GCTTTCTCCT CT C AAATGTA TGCTGCAGAG CCTTTGGCTT 
995 01 ACAGTGGATA TAGCCCAAAT TCCAGTGAAA AACTTCAGTC TTGCCTAGGT 
99551 G C AGAAAT AG ATGGAGCTGT GCTTTTAACA AGTACTAACT ATAAGCTTCT 
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996 01 TCAGTTCTCA AACTCTTTCA G CAGAC C AAA AC ATTTTT CA GTACAGTTTT 
99651 GTTCTTTAAA AAACT CATAA AGCTTTGTTT CTATTCTTAC ATGGAAAGCA 

997 01 AT C C ATTACA AAATCCTCAA AATAGAATGA CCATCCTGCA GCTGACTCTG 
99751 CTTGGAACTG CATTATTTTC TCTACATCAA GTGGTTGCCA T C C ATGAGAA 
99 8 01 GCATCCCTAT GTTTCTCTGC ACACTGCAGT AAGAGAT C AC GTATATATCA 
99 851 CACTTTTCCC TTCACCCATC TTGGGAGCAG TGCTACAGTA AATTGTATAA 
999 01 TTACAGTGCC CCAGAGATGA GAAGAAACTG AACAGCAGGA AAGGAGACAC 
99 951 AGTCTTAAAA AGAAGAATGT TTTCCAGGAA TTGATGCACT TTCTTGCACT 

100 0 01 CCTTGGTAAT ATGGGACTAC TCTTGCCTCA CCTTTAGCAG TGGGTGCTCA 
100 051 TTAAATGGTG AATGGTGGTG GGTCTTCTGG TTCTCCAATC ATGT CTTATT 
100101 TTCTCATAAT ATTTTGGGAT CCTTAGATTC ATCTGACTGT GAGAATCACT 
100151 TGAT CTGATT TTTTTTTTTA ATCTGATTTT G C AG CTAAGT TT AT CTGAAG 

1002 01 TGTATTATGC TTATCCTCTT TTTTAAGGGT TTTTTTTTTT TTTAAAGTGT 
10 0251 GTGTATT CAT TATTCGTTTG GCTCTAGTTA TCGATATGGC TCAATCAAAT 

1003 01 TAATGTTTAA ATTCTGAAGT AGAGCATGAG ACATGCTAGA CTTGAAGTTG 
100351 GTACAGCTTT ATAAGATACA AGAAAAGCCT GAATAATTAC ATTCTACTAT 
100401 TAGGTTTCAC TTCACAAAAT AAATTTGGCT TTCTCCAAGT AG AGT AC C AG 
100451 TCTAATGTTG GCCTACTCAG TGCTTTCAAG CACAATGAAT CAAAAGGCAA 
1005 01 TGACAAAGGG TAGTAACTCA AAGGATGACT CTTAGAAGGC TAACAGGGGG 
100551 AGTGTCCGAA AGGGTACTGT ATATAT CACC AAGGACT CAG AGAATCTGTT 

X gene exon L 

10 06 01 CAGGTTCAAC TGGCAAGCTG GATTATTACG AGCCTCTTTG ATGTTTTTCT 

100 651 GTAAGTACTT CTCCAAATAA AATGTAACTT CTAAGTTGTA TTCTTGAATA 
10 07 01 TGGAAAAAAC AAAACAAAAC AGAAATATAT TATGTAAGAA CTTAGAGGAA 
100751 AAAAGGGCCG CCTTCTATTT TATGATGTTG GCCCACCACA TCAGAGGCAG 
10 0 8 01 ATGGTGGTGG TATGGCAGTA GAGGTTGAAC CTTCCCACCA ACACCCCGTT 
100851 ATGTGTTGTT GCTGTGTGAC AGATGGCAGC AGAGGGGCAG TCTGACAGAA 
100 9 01 TGGCGTCT CA CATGGAAGTG TGTATGAAGC AAAGGTGTGT CACTGAATTC 
100951 CTCCATATGG AAAAAAATGG CACCCACTGA CATTCAT CGA TGCTTGCTGA 
1010 01 ATGTTTATGG AGACAAAACA GTGGATGTGA GCACAGCGAG GCAGTGAGTG 
101051 GTGTGTTTCA GCAGTGGCGA CAGTGACAGT TGTTCACCTC CACTGGTACA 
101101 GAATTTTGCC AG CAGGAAAT GCAGATTCTT GTACATTGTG GGCAAAAATG 
101151 CAT AG CT AG C TGTGGTGGCT ATGTTGAAAA ATAATGTTCC GTGGCTGAGA 
1012 01 ATCTGCTCAA GGAAATAAAG TTATTGTAAT CATTATAATA ATATTATACA 

1012 51 TGTGCTTTCT ATCTATTGTA GTTTACATGA AAATAAATAG GAGGCATTAC 

1013 01 TTTTGGTGTG AT CTGTATAC AGGACAGATA TGTAAAAAAT ATTTCTGGAA 

1013 51 GAGAAAATTT TTGTTTTCAC AGTCTCACTC CCTGCAGAAC ACAGGTGAGG 

1014 01 TACAGTAGGA TAATTCACAG AGCCTTGTTA GC AC CAGGAA CCTCTCAGGT 

1014 51 TATGTAGTAG ATCACATTTG CTACAAACTA TGGATATGCT ATTATTCCAA 

1015 01 CTTAAAACTG TT TTAGAACG GGGAGGGCAC TATTCAGCTT TCTTGTTCTC 

1015 51 GGATTAAAGA AAGAGAAGGA CTGTAGATTT CAATAATTTC CCCTAAGTCT 

1016 01 TGACATTAAA TTGCATGTAC AAGACCTTCA CCTGGCTGAT CTGATGCAGC 
101651 TTTACAGTGC ATTAAGTAAT TTAG C CAGAC TGTGTATTTA CGGTATATAG 

1017 01 ACGTTTGTTT GTTTTTGTCA ACAACAAAAA AAAGGAAT C A GCAGAGATTA 
101751 AATGTCAAAA AATGAGAATA TAGAGAAGAA GCCCACTAAA GCTATAGTTT 
101801 GGCATCTAAG CAACTGGCTA GATTTACAAA GAGATTCACT CTATAAATTA 
101851 C AGGACAG CA ACCTCCAATT TTATGGCCAG TTGTACAAAG AAGCAGTTTG 
1019 01 AAACAAGCTA AGACTATTGT GGTTTGACTA CATTTGATTG AAATATCCAG 
101951 AGTATGGTCC AGAGTAGACA CAGAAGAAAT GAAATGTGTT TACATTGTCT 
10 2 0 01 CAAAAATCAT TCAGAGTTCT CTGGATGGCT ATAGGGAAAC T CAT C T AGT C 
102 051 CACACTGTAT T CAT CAT ACT GAAGCACAGA TGAAACTATC TATTTCCTAA 
102101 AGGGCAAGTA CAAGATAGTG TTTTTATAAT GAACCAGTAC CTTT CTGAAG 
102151 GAAAGTAAAC ATGCATTTGG GAAACAATGG GTCAGTCTTT AC AATATTT C 
10 22 01 TAATGATCAC AGAATTTTTA GGCTTTACAT TATTGTTTCA GCATCACAGA 
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102251 AACAGCAATG AACAAGCAGC TTCTGGGCTA CAGGAAGTAC TTTTTACTAC 
1023 01 AAGTGCCACA CGTCAACACC ACACAGTAAT AATCCTGTTT CTTTTAGACA 

X gene exon 1 

1023 51 ACACCGATTT CAGGATGGGC TCCATCAGTG CAGCAAATGC AGAATTTTGT 

1024 01 TTTGATGTAT TCAATGAGCT GAAAGTCCAG CACACAAATG AGAACATCTT 

1024 51 GTATTCCCCC TTGAGCATCA TTGTAGCCTT GGCCATGGTC TATATGGGAG 

1025 01 CAAGAGGCAA CACTGAGTAC CAGATGGAGA AGGTAAGTTA TGCAAGTAAA 
102551 TACAAGCTCA TTTTGATCCT GGTTAACAGA ACAAGTT AT C CATGAAGATC 
102 6 01 TTTGAGACTT TCTCCCCTTA AGGGGCCAGC TGCTGTACAT TTGCCACTGG 
102 651 ATTTGAACTT GGCTAGCAGA AGGACATTGA GCCATGAGGT TTGGATCTGG 
1027 01 AACTAACTTT TCACTTATTG CTTTTCACTA CAAAGGGTAA CAACAGTTTC 
102 751 TACTAAGGAG GAGATCTCCT GCTTCAGTTT ATATTATCTC ACAAACCTGA 
102 8 01 CTCCTTCCAG ATAAAATGAA CAAATTTTCA TGTATAAAAG ATGAAACACT 
102 851 CAGAAAT C AG GAGTCACAGT TCTAAGTACA GTATGGGTGT AGCTGGTTTC 
102 9 01 TGGATGGAAA AATAAGTGAA CTAATTGGAA GAT C CTATCA AAAAATGTTC 

102 951 AGAGCAGCAC ATGCAGTAAA AAAACAAACA AACAAACAAA CAAAAAAAAA 

103 0 01 CCACACAAA? TTCAACCTCG AATGAAACTT CTCAGTTCAG CCATTGGTTA 
103 051 TTTCAAGCCC AGAATTTGAA CACAAAATCC AGAGACTCTC AGTGAACTTT 
103101 GCATACTTCA TTTCTTCTTC TGCTACTTCC ATTTGCAGGC TCTTCACTTT 

X gene exon 2 

103151 GACAGCATTG CAGGACTTGG AGGAAGCACT CAGACAAAGG TACAGAAACC 
103 2 01 TAAGGTACAT TATTTTTCTC TCACATTCAC TTTTTTTTTT TTTCCTGAAA 
103251 ACTTAAAACT GTTCTGACTG TGCTTCCAAT AGGTCCAGCC CCTTCCCAAA 
103 3 01 CCCTAGCTAA TGCTCTCAAC ACATGATATG CAAATGAAAA ACTAAAATTT 

1033 51 GTTCTAAAAA AAAAAAAAAT AATGACAAAA AGAAGGCTCA TTTCACATGT 

1034 01 TGC AC CAGAA AAAGTGATAG GATAGTTGAA GGACATTTTG AGC AC CAGGA 
103451 TACCTTCCTA CATTGATAAG AACTTGCACA CTTGTAGGGC TTGCTGGAGG 
103 5 01 ACCACACATG AACCATGTGT GCTTTTCTCC TTGGTCACTT GATACATTTG 
103 551 GAAAGATAAC ACAAGCCATG CTCCCAGGGC TGTCCTCATC CACTTGGGTT 
103 6 01 CTCCAAGCAC AATGTGGGGC TTGTAAAGGA CAAGAAGATT TTTCCGTTTC 
103 6 51 CTTTTCTTTT TCCTTTCCCC CTTTCACTTT TTCCTTTCCC CCTCACTTTC 
103 7 01 TCTCTCCTTT TCCCACTTCC CTCTTTCCTT TACATTTCCC ACTCTCCTCT 
103 751 CCTCTCCTCT CCACTCCACT CCACTCCTCT CCTCTCCTCT CCTCTCCTCT 
103 8 01 CCTCTCCTCT CCTCTCCTCT CCTCTCCTCT CCTCTCCTCT CCTCTCCTCT 
103 8 51 CCTCTCCTCT CCTCTCCTCT CCTCTCCTCT CCTCTCCTCT CCTCTCCTCT 
103 901 CCTCTCCTCT CCTCTCCTCT CCTCTCCTTT CCATTCTATT CTTTTTGCTA 

103 951 GAGCATTTAG ATGGTTATGT AGAACAATTC ACAAAACACA ATCAGACAAA 

104 0 01 TCACTCACAT TTTCTGTTTC TTATCACCAA GACTGAGTGT CAC CAAATG C 
104 0 51 TATCAGTTGT ACATGCTTAT ATAGAACATC TCTCCCATGG AGCTTTTAGA 
104101 CTCTAATGTA TTTTGTTTGC AAATGTCTGA ACACTGTGTG TTTTCCTACG 
104151 TGATCTGTAC TTTATAAATA GTTGTCTTTC TAGTAAAATA AGCTAACATT 

X gene exon 3 

1042 01 TATACCCTTT TTCCTCCTCT TCAACAACCC AGTGTGGCAA ATCCGTGAAC 
104251 ATCCACCTAC TCTTTAAAGA ACTCCTCTCT GATATTACTG CAT CAAAAGC 

1043 01 CAATTATTCA CTCCGCATTG CCAACAGACT CTATGCAGAA AAGTCACGTC 

1043 51 CTATCCTACC GGTGAGTTGT ACAACAGAGT GATTTTTTGC TAGATCCTGT 

1044 01 ATAAACCCAT AATC CAGGAG TACTGCCCAG AGTATCTGTT AATCCAACTC 
1044 51 ACCTCAGCGG TGTGGACTTC CACAGCTTTT CATTTGACAT TCTCAAAATA 
104501 AAACACACAA ATATT CTAAA T CAAATAC AT TTTATCTTTA AAAATAGAGA 
104 5 51 AAAATGCTTC AAAAATAAGG ATTTTATTAT AACAAAACAG TTGCTAATGG 
104601 ATGCTAATGG ACCTGAAGCT GTTTTTGGAT TGGTATTTCT TCAAGAAAAT 
104 651 ATTTCAGCAT TTCTACTACG TAATCTTATC TGGTAAAGTA ATAAAAATCT 
104 7 01 TAAAGAT CTT AACATATCAT G CAT CGAAAT AATTTTGCTG GCCCAGTTTT 
104751 AACCATTTCG TCCAGGAAAT AAGCCATGAA AACAGTCTAA TAGCATAATT 
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104 8 01 AT AAAAAT C A TGGAACATTT TAACTGCATT TTATTTCACC TTCACAGTCT 
104 851 TTTTAAAAAC TGACTTGGTA GCTACAACTG TTGTCTTTAC AGATTTACCT 

X gene exon 4 

104 9 01 AAAGTGTGTG AAGAAACTGT ACAGAGCAGG TCTGGAAACA GTGAACTTCA 

104 951 AAA CAG CATC AGACCAAGCC AGGCAGCTTA TTAACTCCTG GGTGGAAAAG 
1050 01 CAGACAGAAG GTAAGCTCAG AGGAGAGTTT ATAATATACT TCCTTGTTAC 

105 051 TACTTTACCC AAACAACTTC TGGAAAGACT ATTCCTTCCA TCTCCATTAA 
105101 TGGATATTTC CTGTGGAAAC TGATGACTCT TGCACACTTT TTTGTGTGCG 
105151 GTGACAGTGA ATTTAAATAT ATATGACAAA GGCAGGGATG CCACTGTGTG 

1052 01 CTTTCTGTGT AAGGAGAGCA TAACTCATGC AAGATTGGTC CCAGCTTCCC 
105251 TACAATATTG GCATCATTTT ACAAGCATAT GCTGGATGGA TAAGAAATGG 

1053 01 GCTTCCGTGG AAGAAAATAA TGTGGCCACT AAGTTGGTGT AAGAAAAGGA 
105 3 51 ATGATTAAGA GTGTATGTAC ATTTATCAGG AAAAAGGTGG GAAGAAAACA 
105401 AGAATCAAGT ATTAGAAGGA AGCACAGTGA GAGGCAGAAG AT CGGT AT C C 

1054 51 CTGCTTTGCT TTTCACTTCC TTCTGTTCCA TGCAAGTCTT TTTCCAAGGA 
105 5 01 CGTTTGAGAT ATTCCTGGGG ATGTGTGTGA ACATT CAAGC CTACATGCCT 
105551 CCTTACAGAA ATGCCTGGTT AAGGGTTAGT TGTTCTGTAT GAAAT C ACT C 
1056 01 GTGAACTTGA ATTCCACATG CCATCATTTA AAGAACAGGA AGTCAACTCA 
105651 AGCTTGCTGG TTGACATCTA AAACAAAACA CTCCTGCAAT GAAAACAAAA 
105 701 CCCCACAAAG CAGCACCCTC CAATCCCTTT GCCTCATACA TGCAAACCAG 
105751 ACAGACTGTG TCTTAGCACT CACTGCTTTG CTTCCTTCTT ACAGGACAGA 

X gene exon 5 

105801 TCAAAGATTT GCTTGTATCA AGCTCCACTG ATCTTGATAC AACGCTGGTC 

105 851 CTCGTTAATG CCATCTACTT CAAAGGGATG TGGAAGACAG CATTTAATGC 
1059 01 AGAAGACACT CGAGAAATGC CCTTCCATGT AACAAAGGTA GGGGACGTGG 
105951 TCACCGCTTC TGGGCAGGAC AGAAAGC CAT CAAGGGTGCG ACATACACCA 

106 001 TCCTACAGTC ATTGGTCCAT GGTTCTTCTG GGCCCCTCGC TGACAGGGCA 
106051 TGGGGCTGAG CCCAAGACAG GCTGGCAAAA ATTGTGTCTG ACCAGGCATC 
106101 CAAAGCACAC CTGTAGACAA GAGAGGAAAA TGGAGACACA GCTTGAGGAT 
106151 CCAGCCCAGT TCCTCTGAAG GACTTGCACA TCTGCCTGCT T C AAGAG AAA 
106201 CTGCCCCCTT CTCACATTGT CTCATGCTTC TGTTTTGCAG GAAGAAAGCA 

X gene exon 6 

106251 AACCTGTGCA AATGATGTGT ATGAACAATA GCTTTAATGT GGCCACACTG 
106301 CCTGCAGAGA AAATGAAGAT CCTGGAGCTC CCATTTGCCA GCGGAGACCT 
1063 51 GAGCATGTTG GTGCTGTTGC CTGATGAGGT TTCTGACCTG GAGCGGGTAC 
106401 GGCCCTGGCA GGGGAAGCCA ACTAGTTCGG AGTT CAGTGG GAGCTGGCTG 
106451 CTGTTAGACC TTTGGCTCTG CTCTCGCTCC TTGGCTGTGC TGTGCTGGCC 

1065 01 AGG CAGGGGA GCACAACAGT GGCCCAGGTG CTTCCAGGCG CTCAGGCAGA 
106551 GGTTGGCCTC TAAGGAGAGC CCTAGCCTCA ATGTTATTAA ACAAAGAGTA 

1066 01 CAGCAAAGAA TACAAAGGTA AAGGAGCGTA GGGCTGCTGT AATGTTATAG 
106651 AAGGGCACGT ATGGGCAATT CTTTTCATTG AGAGGCAGTT TCATCTGGCC 
106701 TCTTATATAA ACTCTTCAGC AAATGTTACT AGAATTGATG AGGTTCAATA 
106 751 ATCCCTAATA TTTTTGACAA TATTCTCATC AAATATTTTA AATAAGCTGT 
1068 01 TCTCAGAATA CCAAAGTAGA TGCAGAAATA TTTGTGTTTG TTTGGTACTA 

106 851 TCCACTGTAT ATAAATTGTC ATGGCATTTT TTTTTTTGCA ATCTCTTTCA 
106901 CCAGCTGACC AATCTGCTAT GTAGTGAAAT TGCTTTATTG TTCTGTATGA 
106951 GACACGAAAA TATTTGTACA GAAGGGGATG TGTCAGGTGG AACCAAATAA 

107 0 01 AGGAGCACTG AAGAGGAAAT ACTAGAGAAA CAAATGTTAA AATAGGAAGA 
107 051 TGTTGATAGG ATGCACCTTG GGAAACTTTC TATTTTTTTG TAAAATAATA 
107101 GTCTTGATTA AAATGAACGA TGGAAAGAAG TTGCATTCTC AT C AC AGGC A 
107151 TTTTATTCTC TCCCTCTCTT TTCAGATTGA GAAGACAATT AACTTTGAAA 

X gene exon 7 

107201 AACTCACAGA GTGGACCAAT CCCAATACCA TGGAGAAGAG GAGAGT GAAA 
107251 GTGTACCTGC CCCAAATGAA GATTGAGGAA AAATATAACC TCACATCTGT 
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1073 01 CTTAATGGCA TTGGGAATGA CTGACCTGTT CATCCCTTCA GCCAATCTGA 
107351 CTGGCATTTC TTCAGCAGAG AGCTT GAAG A TATCCCAGGC TGTGCACGGG 
10 7401 GCCTTCATGG AACTCAGTGA AGATGGCATT GAGATGGCAG GCTCCACAGG 
107451 GGTGATAGAA GACAT CAAGC ATTTCCCTGA GTTAGAACAG TTTAGGGCTG 
107501 ACCACCCATT CCTCTTCCTG ATCAAACACA ACCCAACCAA CACCATTGTC 
107551 TACTTTGGCA GATATTGGTC CCCTTAAAGA GAGAAAGAGC TGGCAATAAC 
107601 ACATACCTTC CCCTCAGAAA CAAAATCCCC TTACCGTAGT ATTATAGCAT 
107651 AATCTTATCT CTTTCATAGA AAAGACATAC CCGCAGGAGA GGAGAC AG C A 

107 701 CGAAGCACAC TTACTCCTTC CCTTCTTGTA TTAATTT C AG AATGGCTTGA 
107751 TATGAGCAAA GACTGAGCCA ATGAGATGGT GAGAATGAAG ACACCTATCA 
10 78 01 GCCATTAAGG TGATAAGTGA TTTTCACCCA AGGAATAAAT AGTAAGAATG 
10 7 851 ACCCTAAGTC CTTGGGAGCC CGTTACATAG AAAGCAATAA GCTTTGCTCA 
107901 TCCCATTCCC TGGTAACATA CTGCTGACAA ACCCACGTTA CCATTCCTGA 
107951 AACATGGGCT TTGAGATCTC CAGTCTAGAG GGGATGTTTG TGGAAGAGTT 

108 001 TCTGGTGTGC AGATTATTGA TTTGTGATTA TGTCAATTTT ATTTTTCTTT 
108 051 ATTTGGTAAT TGGGCAATGG TATACATGTT CACTATCAGT GGAGTTGTCC 
108101 T CT C AC C ATA AGTCCTCTCA CCTAGTTCTG AATTTCTTGC AGAGGTTTTT 
108151 CAAAGTCCTG AAGAGTCTCC CTTCCATTCC AGAGAAGGGA AATAGATCCA 
1082 01 GTTTTGCATA GGTGCAGTTA TGCCTTTTCT CAGAGTG C AG ATTCAAAGCC 
10 8251 TGAAC CAT AG AGATCCAGAT GATTCTTATG ACCCAGAACT CAGTGAGATC 
10 83 01 CACTGGGCGA AAGAACTGTC TAAATTTTTG TTTAAAACTT TGGAAGACAC 
10 83 51 TTCAAATTTG AGAACACCTT TTTGGTGAAA AATCCTGAAA GTGTTGTAAA 
108401 ATACTTCTTC TAAGAAACAA ATTAGAATCC TATTTTTTTT CTGTCTTCTC 
108451 TTCCTACGTA TAGATTGTCA ACTGCAGATG TGGATCCTCT GGCTCAATAT 
10 8 501 TATATTCTGT TTTCATTCTA ATCACCCATT ATTGATTTAG ATACATACAG 
10 8551 TTGATTTTGT TTTGGTTAAC ATAATAAAAG AAAACCACAA ACAGTTTTCA 
108 601 TGTAAATTAT ATTAGCTTTC TGAAC C AC AC ACTCTTAAAA ATATCTTTAC 
10 8 651 ATTTTAACAA CTGTGAGTAA AACGTGATTT AGCAGAAAAA TGTATTCTTA 
10 8 701 GAATGAATAA AGCATGCAGA TATGAAGTTT TTCAGGCATT TATGACATTT 
10 8 751 TTAGGAGTAC CTGTTTTCAA GAAGAACTTC AC CAAAGAC C TACAACCAGA 
108 8 01 GTTTGATTCT TCTCTGTATT TCAGATGACA AGAAGT AC CG CAATAGAATA 
108 851 CAAAGTTATT CCTATCTATT TTTCTGTGCC ATTC CAAC AG GCATTAAAGA 

108 9 01 TGACCTGGCA ATTTTTTCTG GTAAATATTT CAAGGAACAA CTATTCTAAC 
10 8951 AGTTTTACCC TTTTATACAG AATCACAGAT T C AT AGGAGC TGGAAGAGAC 

109 001 CTCACGGGAT CATCTAGTCC AACCCCATAT GATTTCATCA TTTTATATGA 
109 051 TGAAATAATC TGGAATTCAT ATAACTTGAA AGGCATAAGA AAGGTTAAAT 
109101 AGATCAACAG CTACTAGGCA AAGCATTTCC CTATGCAGAG TCTTGAGAGG 
109151 AGGAACTCTG ATGTTAACAT CGCCTATTTC CACATTAGTG TTACCACTGC 
109201 AGTGTCAATG ATAAAAGGTG AT CTGTAGAG TAAAAACAGC TGGTGCTACA 
109251 GGTATGACAC CCACATTTTT TGTAGATTAT CAGGATACTC ACAATACAGA 
1093 01 CACAGCTGTT TTTCAATGGT AAAACCAAAC ATTTTACCAA GTATACTTTA 
10 9351 TTTTTTGCCT TTAGAAATGG AAGTAGTGAG AAGAACAGTT CCAAGGTAAG 
109401 AGAAAT AT C A GCATCTCAAG GTTTACCATC AGTAGTTTAT TCATCTTTCA 

CR1L 

10 9451 CATCTCTTAT GTC C ATAGAA TCATAGAATC ATAGAGGTTG GAAAAGAC CT 
109501 TAAAGATCAT CAAGT CCAAC CATATCCTAA CCATACTACC CCAACTTTAA 
109 551 CAACCCTCTG CTAAAT CAGA AGCAGTTGAT CCACTCTGCT AAATTTAAAA 

1096 01 GCCAGTTCAC TTAAACAATA AGAAAACTAG AGGAAGATTA CATTTGCAAG 
109 651 CTACTCTTCT ACGATTAATA GACTGGAAAG TG CATAAAGT ACAAAGATAT 

1097 01 GCCTCAATGA TCTGAAAAAC AGCTGAGGTC AAAATGGAGA AATGGGGAAA 
109 751 AAAATTGAAG GTTTCTTGCT TGCAAGCAAA TATAAAGCTT CCCCTTTCTC 
109 8 01 AAAAAGAAAA ACAAGAGACA GGAAAGAGTG GAAATT CAGC AATACTGAAC 
109 851 AAAAATTGCA ACAAAATACT GATGGCCAAG CCCTGGCCAC CACTGACCAG 
10 9 901 GCAGGGGGCA GAACATACAA AGGGCAGGAT AAAAGTGTTC TCCATGAAGG 



WO 2004/094640 



PCT/US2003/039244 



43/79 



109951 


GGGTGGCAGG 


CCTGGTGGGT 


GGGATGATGA 


ACAAAATAAA 


TACACTTTTA 


110001 


AAACCTTCCT 


GTGGCTCCAG 


GCATTTTTGC 


CCCAGCCTAC 


CAAAGGCTAT 


110051 


TAGCATTTTT 


ATTTTCTGGA 


GTATTAAGAC 


CCTGTTTCTT 


TGACAGACTA 


110101 


CTGTGCAGCA 


ACTGACAGAA 


GGTTTAGTGG 


GTAAGTCACA 


GGGGATAAAA 


110151 


TGTTCAGCAT 


AG AC C AAAG C 


AAAAACATAA 


TGTCATGATG 


GGTGATCAAC 


110201 


TGGCTAACAG 


GTTGGGCTCA 


AAGGATTACA 
CR1 GG 


GTTACTGGGG 


TTACATCAGG 


110251 


CTGGTAGGCA 


GTCACTAATG 


GGGTTCTGCA 


GGGCTCAATT 


TTAGGGCTAG 


110301 


TTCTCTTCAG 


TGTTTTCATC 


AATGACTTGG 


ATAAAGGACT 


TGAAGTCATA 


110351 


CTAAGCAAGT 


TCATGGATGA 


CACAAATTGG 


GAAGTGCCAT 


TGACTCCCTT 


110401 


GAGGGTAAAG 


AGGCCTTACA 


GAGAGATTCT 


GACCAAT C AG 


AGAGCTTGGC 


110451 


AAT CAT C AAC 


TACATAAAGT 


TTAACAAGAG 


C AGGTGT CAT 


ATTCTGCACC 


110501 


TGGGATGGGG 


CAGCCTTGGC 


TGTGTGTACA 


GACTGGAGGA 


CAAGAGG CTG 


110551 


AGAGCAGTCC 


TGCCCAAAGG 


GACCTGGGGG 


TTCTGGCTTA 


CAGCAAGCTG 


110601 


TAT C TG AG C C 


AGCAGTGTGC 


CCTGGCAGCT 


CCAAGGGCCA 


ACCGTACCCT 


110651 


GGGGTG C AC C 


AGGCCCAGCA 


CTGC CACTGG 


GTGAGAGGAG 


GGGCTGTCCC 


110701 


ACTGTGCTCT 


GTGCTATGCA 


GCTGCACCTC 


CAGCACTGCA 


TACAGGGTTG 


110751 


GCTGCCACAA 


CGTAAGAAGA 


ACACAAAACT 


ATTAGAGAGC 


AT CCAAAGAA 


110801 


GGGCTATGAA 


GATGGTGAAG 


GGTGTGGAGG 


GCAAGATGTG 


TGAGGAT CAG 


110851 


TTGAGGTCCC 


TGGGTTTGCT 


CAGCCCAGAG 


CAGAGGAGCT 


GAGGGGAGGC 


110901 


CTCATGACGG 


CTGCAGCTCC 


T C AC AAGGGG 


AGTGGAGGGA 


CAGTGCTGAG 


110951 


CTCTGCTCTC 


TGTGACAGCA 


TGGGGCTGTG 


TCAGGGGAGG 


GT CAGGTTAG 


111001 


GGGTTAGGAA 


GAGGGTGATG 


AGGCCCTGGA 


ACAGGCTCCC 


CAGGGCAGTG 


111051 


GGCATGGCCC 


CAAGCTGCCC 


GAGTT CAAGG 


AACATTTGGA 


AAATGCTCTC 


111101 


AGATGTAGGG 


CTTGGATTTT 


GGGTGGTGCT 


GTGTGGAGCC 


AGGACTTGGA 


111151 


CTTGATGATC 


CTTATGGGTC 


CCTTCCAACT 


CAGGATATTC 


TGTGATTCTA 


111201 


TGACAAGATG 


CACTACTGTT 


CTATGTGTGA 


GATACTACTG 


TTCTGTGTGA 


111251 


GATACTAGTA 


GCCAAGGCCT 


TCACAGGGCC 


TTTCTGAATG 


TGCCTCCAGT 


111301 


G AATGGT C AC 


CGGAGTAATC 


CCCTCTGTCA 


ACACTGAGAT 


ACACATCTCT 






Y: 


OV 1 HOMOLOGY 




111351 


GTCACCATCT 


GTGACAGGCT 


AAGGCAGCAG 


TGCAGGCAAC 


AATGTCAATC 


111401 


TCTT CAGAAT 


GGCACAGCAC 


TGCTGCAGAA 


AGGGGTCTGG 


TACGCTGTGA 


111451 


GCTTCTGTCT 


GAAAAAC CTT 


GAC CAAACAC 


TGGT ATT CTT 


TGGACTAAGG 


111501 


AAGCAACATA 


ATTCCATAGA 


ACAACTGAGT 


GGGAAAT C AC 


CACTGATAGC 


111551 


TATTGC AT CA 


AGTTCTGCAA 


CAGCAACTAA 


GAAATCACTG 


GCAATCACTG 


111601 


TGGG CAAGAC 


AAACACTAAA 


TGGT CAATAA 


GCTCCCTGCT 


AC T AGAAAAA 


111651 


CAGTGGAAAC 


ATAAATGAGA 


ACAAAATCTC 


TAGTTGTGCA 


AAGGTTTTTG 


111701 


TGGTAGAAGG 


AGAGACTTTG 


TTCTATGAGT 


TGAC CTGGAC 


TTCATATTCT 


111751 


TTTGGAAAGG 


ATCAGATGTC 


AAAGAGTCTG 


TTAGTTTAGG 


GACAGGCCCA 


111801 


CAGTGAAATA 


CCTGGTAAAG 


CAAATAGCAG 


CTAAGTCTTA 


GCTGACCTCT 


111851 


TTCATGGGAT 


AAGCATTGTC 


AAAAAAAGGA 


CATGTCACAA 


CAACACATCC 


111901 


TTTTGTGTTT 


GTGCAGCATT 


ACACAGCTGC 


G CTGGAC AG C 


GT AAT C CAT C 


111951 


AGCTATCCCA 


GAAAGCATTT 


CACATACAGG 


AAGGTTTGCT 


TAATTTTGCT 



112 001 AGACTTCACA A C AG AT C T AA ACTTGATAAG TAGACTACAC AAGAAGTCCA 
112 051 ATATTCTGAT GATTCTGCAC TGATGACTGA CCACGATCCA ATTTTTCACT 
112101 TACTGTGGAC TGATTTTTAA ATT CTGC AAC TCGCTTTAGA TTAACCATTC 
112151 ATTTGACAAA AAGAAAAAAT CCAACAGCAA CAGTTGGCTG GGATGTATAT 
112 2 01 TAACTTTTTC TGGAGGAATC TGCTGCGCCT TCTCTATCAC AAAACAAAAA 
112251 TATTCCTCTC AGCACTGAGT ATATTTAACG CAGAGATATT TTGAAAGCCA 
1123 01 TATAATTACT AACAACATTA GTGCTCTGAA TTAGCTATTA TGACACAACT 
1123 51 GTAGTAT CTT TGTAGATCCT GAGTTGTAGG CTGTCTATGA TGGCCCAAAC 
112 401 ATATGATTCA GGCAGATGGT ACACAAATGC CCAGGGAGCT CTCCTATAGC 
112451 AAGCTGTAGC ATGTTGCTAG ACAGTTTGAT AGTAAAAGGA TTTAAGACAT 
112 501 AAT CAAAAGG TAGAGGAGAC ACAGTACAAC TTGTTGGATA ATCCTTTGAC 
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112551 TTTTGAGCTG TCAAGTATAC 
112601 CCTGTTTGAA CAGAAAACAT 
112 651 GAAGGGGCAG CAGCCACAGA 
112 701 TTGTGTGATT AAAGATATTC 
112 751 ATGTCTTCCA CCAGCACAGA 
1128 01 TAATTTACAT ATTGTTGAAG 
112851 GTTTGATGTG ATGGGCTACC 
112 901 CAAAAGTATT GCTGCTGCAT 

112 951 ATAGATGGTA TCATGTAACA 

113 0 01 CTGAGGTCTT CATTATTAGT 
113 0 51 TAATGAGGAC TCTATTAGGC 
113101 ACTTTCACAG ACAACTAAAA 
113151 AAACATAATT TGTTCTAGAC 

1132 01 GGAGTTCTGT TTTAAAATCA 
113 251 TGTTGCCCAT TCTTTTATTT 

1133 01 C TATTGT ATA TTATGACTGT 
113351 GGCATGAAGT TACTTGTGAA 

1134 01 AAAGACACAT TTTGAACCTC 
113451 ACAACTCAAA TCAACATTCA 
113 5 01 GTAGCTTCTG ACTGGAGTCT 
113 551 ACAGAAGTCT CTTCTCTGCT 
113 601 CAGTTCTGTA TTTTTTTTTT 
113 651 TCAAAATTAA TTTGGATTCA 
113 701 ACAATCTGCA CAGACCATGT 

< 113 7 51 CCAGCAATGC TAGACTGCTC 
113 8 01 CTGTAAGTGG CATCAGGCAC 
113 851 CTGTAGGTAT CCCTATTATT 
113 901 GGGTCCCTTC CAAATCAAAT 

113 951 TTGCAGTAAG TACAGGGTGG 

114 0 01 TGCATTTTTT ATGTTCTCAG 
114 0 51 CAGTACTCAT AGAGGAAAAA 
114101 TTAGAGACAT TGTGTACAAC 
114151 TTTTCACTAT CTGTGCCACA 
1142 01 GTGTTCAAGT TTCAGTGAAT 

1142 51 AGTATACTCC ATCCTTTTCT 

1143 01 GAACTTTTAT TTAGAGACTT 

1143 51 TTCTTTAACT TTTTCCTTTC 

1144 01 TTTTATCTTT CTTTCCTTCT 

1144 51 CCTTTTTTCC TTCAATTTCT 

1145 01 CCATTTCTTT TCTTCCTTCC 

1145 51 TCTTTCTTCC TTTCTCTCTC 

1146 01 TTTTTTTTAA TTTTAATTTT 
114651 CCAAGTAAAA GTGTGTTTCT 
114 701 G CAAACAC AG AATAGGCAGC 

114751 TATTTCAAGG AGTTCTGCAA 
114 8 01 CTCTTATCAT CACAGGTGAA 
114 851 GATTCATGTT CTAACCCATT 

1149 01 AC TGATGGAA CTTTGTGGTT 

114 951 TTAAAGGATG ACTTCTTTAA 

1150 01 TGCATTACTG TACTGTTTCA 

115 051 GTACAAAGTA AAGCTATGTT 
115101 CTGTACTGAG ACACCCTTTC 
115151 TTTAGAAATT AAATGCAGCA 
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AACCACGGAC CATTGCAGTG AGTATTAAAG 
GCTGATTGCT AGCCTTAAGC AAGAAAGGGA 
AACATCTTGC AGTGTGAGGA GTGCTCTAAA 
ACCATGAACA GACACATTCA GTCACTTGAT 
TACCAAAATG GAACTCACGA CAGTGGTGAG 
CAAGAGAATA GCTCACTCCC TTTATAATAG 
AATAAGAGTT AAGGCCTAAT GATCTTTACT 
AGCAATGTCT G CAC CAGACT GGACTGGGCT 
TACTAGTTGT AATTAAGTGT AT CAGAC AGA 
ATTGCTCTAG CATCTTCAGC TGAACAAGAC 
AGAAAGGTAT GGACTATTCA GAGGCTGTTC 
GGGTTAAGGA GTCCACCTCT TTTCTCCAGA 
AATTT CAGAG GCATTTTGTA TATTGACTTT 
GAGCATACTC AGAGGT C AAA GTAGTTTGTT 
CAAAGGATTT ATGAGATTGC TTTATGCTTG 
CCTGCAGACC ATGAATGTTT CACCTGATGT 
CGATCTGTAA GAATGTTCTT TGAATGTGCA 
ACATCTGGTG CTGTGACCTG TTTGAAAAGA 
AAACTAGCAG TGAGTTCGAA TACTTCTCTT 
GAATATC CT A AT AT CTGAAT TTAAAAAGCA 
CAACCTCTTC TGCGACAGTA CATCTTTCTT 
CTTTTAATAC AGATGCTCTG AATATTGCTT 
TACAGTATGC TTGTTGATAC TTTCCTACTG 
TGGCACACAA GGTCCCTGAG TTAGACTGCT 
TGCAAAATGC TTTATTTTTT GCAATTCAGG 
AAGAACTAGA CAATTACATA CAAGTTTTCA 
TGCAGAGGAT TTGGAC TAGA TGGTCTTCAA 
GATGCTTTGA TTCTGTGATT TTATGAAAAG 
GCATAACACA GCAAGGAGTC CTGAATGTAC 
AATGGTGACT GCTAGAGGAA TCTGGACTGT 
AAAAAAAAAA AGGAAATTGA CTTAAATTCC 
TAAAT AT CAC ACTTTTTTTT TTTTGCTTTG 
GTATTTGGTT CTGTGCTTGA ATTATACTTA 
AGCTTTTATC ATTTTTGTTT CAATCTTATC 
CCAAGGTGCC AT ATGAT AT C CTTCCTTCTG 
CTTTCTTTCT TTCCCTCTTC CATTCTCTCT 
TCCTTTCTTT TTTGTTTTCT TTTTTTTCCC 
TTTTTCCTTA TTTCTTTTTC CTTCTTTCTT 
TTTTTCCTTC CATTTCTTTC TTTTTTCCTT 
TTCCTTCCTT CCTTTCTTTC TTTCTCTCTT 
TCTTTCTGTC TTTCTTTCTT TCCCTCTTTT 
TATTTTTTTT TTGTAAATAA AGGACTT CAA 
GACACTGAGT TCCATCCATC ATTCAGTTTG 
ATGGGGTGTG TCATGACATT ATACAGGATA 

Y EXON 1 
GGCTGT AC C A CGTACAGCTG AGAAGCTGTA 
GCTGATAAGG TAAGCATTTC TTTTGGTTAT 
TTTTAAAATG AT CATAAGAC TTACAAGAAT 
TGTCATCAAG . AACAGTCAAG AAACAAATGA 
AAATCTATTC TTACCTTCAC ATTTCTGTTC 
CAGCCTGCCA CATATGAAGT CAAAGTGTTA 
TACTAATTCT GTAACACTGA GAAGCTGGCA 
TTCTTTTTCA TTGATGCCCT TTGTTTCTGA 
CTGAATTTGT TTAAATTCAA GACTTAAGCT 
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1152 01 GAGTTGCATG GTCTACCTAA 
115 251 GCATGGTCAG GTATCAACAA 

1153 01 AAACAAGCAA ACGAACAAAC 
1153 51 TTGCTTCAAC AGTATGTTTT 
115401 TTAAAATATA ATACTCTGTG 
115451 CTATCACCTG CACATGTGAT 
1155 01 GTGTTGATTC CACACAATGA 
115551 TGTACTTGGG AAATTCTACC 
115 6 01 TGGGAAGATA TAAGGACCTT 
115651 TCTATGTTGT CTTCTTTACT 
115 7 01 TAAAATGATA TGTGTTGAAA 
115751 AAAAAAAGAC AGAAAAAATG 
1158 01 AATGAAACAA GCCTAAACAA 

115 851 AAAAAGGTGC CAATAAAATT 
115901 TAAGTCAACC AAGAATGGAA 
115951 AGCAGATGAG TTTGGTAAAT 

116 0 01 TTCCCTAATC TATTTTTAAG 
116051 AAAGTGGTTG GTATAAGAAT 
116101 AT C C AG AAAA AAACTTGCGG 
116151 TCAATAACTG TAGTTTGGAT 
1162 01 AAAGTGATGT TAAAAACTAT 
116251 AATGAAGACT GAGAACAAAC 
116301 GGAAACGATT GTCAATGAGC 
116351 CAACACAGTA CACATGTACA 

1164 01 TGCCAGCCTA CACATAACTG 
116451 ATCCAAGCAC ACACCTGCCT 

1165 01 TGAAAATAAG CTGTGCACTG 
116551 TGGAATACAA CTAATACATT 

116601 ATGGATTCCA TCAGTGTAAC 

116651 TGAGATGAAA GTCCATCATG 

116701 GCATCCTTAC AGCCCTGGCC 

116751 GAATCTCAGA TGAAGAAGGT 

116 8 01 TGGACTCTAC TTCTGCTTGT 

116 851 CTCATGAGGC TCTCACATAT 
116901 GAATTGTATT ATTTTTTTCT 
116951 ATTTTTTTTC CATATTTTTT 

117 001 GGAAAGGAAG AGGGAGAACA 
117051 TAACAT CC AG ATGCTCACTG 
117101 TTTCCAATTT CTCAGAAGGT 
117151 CTTGCAGCCT TTTAGTTCAT 
1172 01 TTGTTTCTGT TTTTGAGCCA 

1172 51 TTTAAATGCA TACCATTTTT 

117301 TTTTCTCTTG CAGGTTCTTC 

1173 51 CCACTGACTC TCAGGTAAAG 

1174 01 CTCCTCAGGA CAAAACTAGA 

1174 51 GACTGTCAAA AGTGGTGGCA 

1175 01 GAGGAAAAGA GTGTGTAACT 
117551 TGGAGATAGG TTTATTTGTC 
117601 GTGGGTGAAT TCCACCATGG 
117651 GGGGT CTGAT TTTCTAGATA 

117701 TTTCCTGTCA TCACCTCAGT 
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CATACTTTCT GAATGAAGTT ACTGAATGCA 
CATACTGCAA ATTAATTTCT GTGTATTCTA 
AAAAAACACA CACACACATG CACAAAGCAT 
TTCAACAAGA T C AT ACATGG AGCTTAAAGC 
GGAGTAGTAA ATAATCCAGA AGTTTGCCCT 
TCAATTAAGA GAGAGATGGA ACACATGAAT 
AACATTTGGC AGAATATCTT GGATTTCCCC 
CTAGGAAGAT TCTCTCTGCT TGTGACAAAA 
AATACTGCAC TTTACAGCAC TGTTGTCTAT 
AAAGAGTTTT TTTTTTCCTT TACTGTTAGA 
CTACAGGGAA AATTTCATTA GAATGT CAGA 
TTTAAATACT GACGATGTGA AGTATCTGCA 
TCACTGCCTT ATTAAAAGGT GGATTTTATG 
AAAGAACAAT TTTGAAAAGT GAGGTATAAT 
CATGTAATAT TTAACAGACA TTTGTCATAA 
CATTATCTCT TTCTATCACT GTGCTTCCAT 
AAGGTAATGA TGAGGTTTGA GACCT CTGAT 
CCAGCTTCCA TTTACATGAA GGTGGAGTAA 
TGTTTTTCCA GACCTACCCA CTTTATATTG 
CACAGAGGGC TGAT CTGTTA ACTGGTGTTA 
AGTGAAAAAC CTGGTCTGGA GTCTCAGGTG 
CTATGTGTGT TTTCTTTCCT GCACAAGATG 
TTCTTTCAAG GCAAGTCTTT GCAATATTTT 
GAGGATAACT CAAGTTTCAA ATAAAACAGT 
GTACCTATAC AAGATTTTGA TTGCTCACAA 
TTTAAATCCA CACTACTGAA TTCTACTTAC 
TGTACAGAGG TTCAAGTGCA CTGACTTCCT 
TTAATCTTTT CTTTAGACAA CGATTT CAGA 

Y EXON 2 

AAATGCAAAA TTTTGTTTTG ATGTTTTCAA 
TCAATGAGAA CATCTTGTAT TGCCCTCTGA 
ATGGTCTATC TGGGGGCAAG AGGTAACACT 

AAGTTGCTTA CATTGGTGTA AAGTGGACAG 
CATTCCTTCT AAGTAATAAC ATATTATCTA 
TTTAATTCAC CAGATGGATC ATGAATCAGG 
AAATTCTGAC ATCTTCCACA TAATGTGATC 
ATTTTTGTAT TAAAAAGATA AAACCCTGGA 
TTATTCGCAG TGCATAATAC ACAACTAAGT 
AAAAAAATAT AATCTAAGCA AATAGTGCTA 
GACATGAAGT ATGAACCAGC TGCAAGCTTA 
CTAATCTAGC ATTTGTTGTG GGTTTTTTTT 
ACAGCTCTAC CCCGAACATC A CGTGTAAAT 
GGTCACGCTT GTGTTTTTTT CTCACTGGCA 

Y EXON 3 

ATTTTGATAG CATTACAGGA GCTGGAAGCA 

ATGTAACTTC TCTCCTTTTG TTCCTATTTT 
ACTACTCTGC CTCTGCTCCA AGCAGTTTCA 
ATGCTCTCAA ACCAAACAGA TCTGTGGAGG 
CACTCTTGTT TAAAGCCAGG GAAACTG AC T 
TGTTTAATGC ACCATCATCA GACTAGGTCT 
CCTGACTGTT AGTGATGGGG ACAGTCCTTT 
AGGAGAAACT AATGTGACAT ATCATCTTGT 

Y EXON 4 

GTGGCTCTTC TGAATACGTC CACAATTTGT 
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117 751 TCAAGGAGTT ACTCT CAGAA ATCACCAGGC CAAATGCTAC ATACTCACTC 
117 8 01 GAGATTGCTG ACAAACTCTA TGTT GACAAA ACATTCTCAG TTCTTCCGGT 

CR1 GG 



117901 
117951 
118001 



GGACTCGGCT 


CTACTGTGAG 


CCACAATGGG 


AATTGGTTTG 


AGCCACAGGA 


TGAGTTCAAA 


CCTTTCTGTG 


GCTTTTAGGA 


GGAGGCTAGG 


CTCACACAAG 


GTATAAGGGC 


TCTGGAGATA 


TTCAAGACCC 


ATTTGGACAC 


TTTCCTGTGC 


AATCTATAAT 


GAACCCCTGC 


AGGGGGTTCA 


AATTGATGAT 


CTCCAGAGAT 



118101 CCCTTCCAGC CCCTGCGATT TTGTGACTCT GTA ATATATG CCCATGCAGC 

118151 AACTGCTACA GGGAGCAATC AAAATTGCTG CTCATTCACT AAAAAATGTC 

118 2 01 TCTTAATGAA AAAGGTG AT T TGTAAGGGAG GAAAATGACT TGAAAGCGTG 

118 251 ACGACTGAAA TTGACAAAAA TATTTTGTTC ATCTTTTCTA AAACTAGACA 

1183 01 TAAAATAACT CACTTAAAGA AAGTTTTGGT TTTGAAATAA AAAACAGGAA 

1183 51 TGTAAGAATA CACAGTTCAA AAGAAAAGGT AGGCACGAAG ATGAGGAAAT 

1184 01 GAGTATTGTC TGTCCTTAAT AATGTTTGCA GAACAGAAGG TTTTATGGTA 
118 4 51 AAATGAAGAA AATATTTCAA AATTTTAACT TAGAATC CAA T C TGAAGAC A 
118 5 01 AAAGTGACAA ATCTAAATAT GTGAAGTAGC CTTGTCCAGC TTTAAGATTC 
118 551 AGTTACAGCA AGAGAGCTGT TTGACTTGTT CAAGTGTAGG GATAGAAGTT 
118 6 01 TCTTTTAACC ATCACTTTCC ATTTCATTAA TTTTGCATTT CATATTCTTC 
118 651 TATTTTAAAG TTCTCAACAG TCAAACACAA TTCTTCTGCT TATAGGAATA 

Y EXON 5 

118 701 CTTAAGTTGT GCAAGGAAGT TCTATACAGG AGGAGTGGAA GAAGTTAACT 

118 751 TCAAAACAGC TGCAGAAGAA GCAAGGCAGC TCATAAACTC CTGGGTGGAA 

118 8 01 AAAGAGACAA ATGGTAAGAA GTAAAAAAAT AGCTGATATT TTCTCCTACC 

118 851 TACTGTAATC TACGCTCTTG TCTTCTTCTC CTCAAAATGT GAAGAAAGGC 

118901 ATATCAAGGA ACAGCACTTG ATTATTGCTA TGAAAGCAAA CTCCCATAAA 

118 951 ACTCACCATG CCCTTCATTG CAGG C ATT CA TGCAAACCAG ACAGGCTGTG 

Y EXON 6 

119 0 01 T CTTAAC ACT CACTGCTTTG CTTCCTTTTC ACAGGACAGA TCAAAGATTT 
119 051 GCTTGTATCA AGCTCCATTG ATTTTGGTAC AACAATGGTC TTTATTAACA 
119101 CCATTTACTT CAAAGGGATA TGGAAAATTG CATTTAATAC AGAAGACACT 
119151 CGGGAAATGC CCTTCAGCAT GACAAAGGTA GGGACATGGG CACTACTACT 

1192 01 GGAAAAATTC AAGATAAAGT GATCCCTACT CACATTGTCT CATGCTTCTG 

Y EXON 7 

119251 TTTTGCAGGA AGAAAGCAAA CCTGTGCAAA TGATGTGTAT G AACAAT AG C 

1193 01 TTTAATGTGG CCACACTGCC TGCAGAGAAA ATGAAGATCC TGGAGCTCCC 
119351 ATATGCCAGC GGAGATCTGA GCATGTTGGT GCTGTTGCCT GATGAGGTTT 

1194 01 CTGGCCTGGA GCGGGTACGG CCCTGGCAGG GGAAGCCAAC TAGTTCGGAG 
119451 TTCAGTGGGA ACTTCTGACT GCTTTCAGAC CTTTGGCTGT CCTCTCACCC 

1195 01 CCTGGCTGTG CTGTGCTGGC CAGG CAGGGG AGCACAACAG TGGCCCAGGT 
119551 GCTTCTAGGT G CT CAGG CAG AGGTTGGCCT CTAAGGAGAG CCCTAGCCTC 
119 6 01 AATGTTATTA AACAAAGAGT GTAGCTAACA AAACAAAGGT AAAGGAGCCT 
119 651 AGGGCTGCTG TAGTCCTGCA GCAGGGGATG TTGGTATATG CAAGTTATCT 

1197 01 CCATCAAGTA CTAGAGACAG ATATGCTAGC AGGATTTCTT TTTTTACTTT 
119751 GAAGAAATTT CAATTCCCAG AGATCAAGTA GAGTTCAAAC ACTGTTACCA 

1198 01 AGTCATAGGG ACCAATTCTG TTGATGACCG TTAATAGATT TTTTTCATGA 

AT -RICH 

119 851 GTCACCCCTC C AAATAATTA AATATAATTT TTTTTTTGTA AATATGAGGG 

119901 ATATTTTAAA TGATCATTTC TCATTGAATG TAGAAAAAAA TAGGAAAAAT 

119 951 ATAACAAGAA AACAAACAGC ATT TCTGAGA GGTTAGCTGC AAACATCTGC 

12 0 001 AAATGAG CAA AAATTTGATT TGACATAATC AAAAACTGAT TTTTCAGAAA 

12 0 051 AGCATTTGAT CTGTTGGAAG AATTTTCAGA TGACAAAGTT TTGGAGAGCT 

120101 TCATCAAGAC AGATGATATG TAGGCTATAG TCAGGAAGAA GCACAAGGGA 

120151 TAAACAAATA GATTTAAGCT TAAGCGTCAC TTCTGTTTTG CACACAAATA 
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12 02 01 AATGAAATAA ATAGCAACAA 
120251 TACTCCTGCT TTATGCATTT 

1203 01 ACAATTAACT TTGACAAACT 
1203 51 AAAGAAGAGC ATGAAAGTGT 
12 0401 ATAACCT CAC AT CT AT ATT A 
120451 CGTTCAGCCA AT CT GACTGG 
12 05 01 TGATGCTGTC CATGGGGTGT 
120551 CGACAGGTTC AACAGGGGCA 
1206 01 GAAGAGTTTA GGGCTGACCA 
12 0 651 AACCAATGCT ATTCTATTCT 
12 0 701 AAGAGCTGGA AATAATGCTT 
120 751 TGTAGTATTG TAGCATAATC 
120 801 CTTCAATATC TAGGGAGACA 
12 0 851 TTTATATGCA GGAATTTATT 
12 0901 GTGTACATAT TCCCAATGTG 
120 951 TTTGGGGTTT TTTTTGTTTG 
1210 01 TTTTTTCTGG TTGGTTGTTT 
121051 CCATTGTTAT TGAGAATCTG 
121101 ACAGCTTTCA GGAAAATCCT 
121151 AT C CT AG AAC TGATGCCTGG 

1212 01 CAGACCATTT GTCTTCCAGT 
121251 CATTTTTTCT GAGGGAGCAT 

1213 01 TCTCCAAACA AAACATCTTC 
121351 AGTGACATGT CACTGATTTT 

1214 01 TTCTAATAAA AGCTTCGCAA 
121451 CTGTACTACA CAAC CTGAAA 

1215 01 AGAGGTGACT AAGCTGGTAT 
121551 CCATATATGA GGTCACTAAA 

1216 01 TGGGTTTAAT CAAGGTAAAC 
121651 TCCCAATACA GCATACCAAA 
121701 CAGAGAACTT TTACAGAAAT 
121751 GTGATGTGTA CAAGATAGAA 
1218 01 AGACTGCTTC AGCTTT CTC A 
1218 51 CTATTGCCTC ATGTGATATT 
121901 TCTTTAGAGG AAAGGGTCTT 
121951 ACTAGTGTTA ATGACAGTTT 
1220 01 TTACAGTATT GGTACTGAAA 
122 051 CCTCAAACAC ATGCCATGTC 
122101 TAG CAGCAGT GAAACACGTT 
12 2151 TTACTGGTCT TCCCAAGAGG 

1222 01 GATTCAAGTA AAAAAAAGTA 
12 22 51 TGTGTACATC ATGTTATTTC 
12 23 01 TTTGCTACAA CTAAAACCAA 

1223 51 CAGGACTGGT AAAATATTGT 

1224 01 GCAGTTATAA TACTGTGTTG 

1224 51 ACTTTCTAGG TACAATAACA 

1225 01 TTTAATATAT AG CGTAAACT 
122551 TTTAAAGCAA AAGATGAAGA 

1226 01 AAAAACTATA CAAAAGAGGG 
122 651 TAACTGTGAA ATGGAAGCAT 

1227 01 CACTAAGGGT AAAAAGTTAC 
122 751 TCCAAAGGTG CCTCTGAGCT 

1228 01 ATCCCAAGTC CAGCTATTGT 
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GTGGTATTAA TACAGTTGGT ATGGCCACCA 
CATTGTCTCT TCTCTTTGCA GATTGAGAAG 
Y EXON 8 

CAGAGAGTGG ACTAGTACCA ATGCAATGGC 
ACCTGCCCCG CATGAAGATC GAGGAAAAAT 
ATGGCCTTGG GAATGACTGA CCTGTTCAGC 
CATCTCTTCA GTAGATAACC TGATGATATC 
TCATGGAAGT CAATGAAGAG GGCACTGAGG 
ATTGGAAACA TCAAGCATTC CCTTGAGTTA 
TCCATTCGTC TTCTTCATCA GATACAACCC 
TTGGTAGATA TTGGTCGCCC TAAAGAGAGA 
ACCTTCCCCT CAGAAATCAA ACCTCTTTAC 
TCAATGCAAT ATTTTATCCA AGTGGAAAGC 
TTCTTGAAGA AGCATGTGAA ATTTCAGATC 
CTCAGCTTAG ATT CAGGATT C ATAT CCAAG 
CTTGAATAAC TTGGGAAACA GGGCCAGTGC 
TTTGTTTTTT GTTTTTTTGG TTTGGTTTGT 
TTTTTTTTTG TGTGTGTGTG TGAGATTCTG 
GTTTCTCTAT AGGAGTT CT C TGAAATAAAC 
GGTCCTTTCC ATTGAATTAG CTGGGCAGTC 
ACAACTTGCA GATGAAATTT TTAACTTCAG 
AATCCATTTG GACTTATTCG TGCTG CGTAA 
ACAGAAAGTC TACCATTTCT TCTTAAATCA 
CTGATTGATA TTATTTCCCA TTTTCATCCC 
GTGAATGTTA ATT AATGGT C TTTCTATTTA 
ACAAAACATG TCATTACCTA TTCTGGGTTA 
AATACGATAT AGCGGGTAAT AATTATTGAC 
GTGGATCCTA TTTTCAAAAT CAGAATGTAC 
TATTTTAAGA TTAAAAAAAA AAAAAACAAC 
CCTATAGCTC CTACTCTTCA ATTGAGCTTC 
TAACAAAATT TTTTGAATTT ACTGAATTTC 
CCTCTAAGGG TCCTCAGTAA ATACATGAAG 
TTTTAAAATA TGAGAAAGGT ATTAAAAGGT 
TGCTGACAAG AAT CACATGA AGAAATCTTT 
CCTCTCGAGA TGTTGTATGC TATTTCACGG 
TAGGTTATAT ATTATCCAAT TATAATGGTT 
TCTGCTAGGA T AC AC ATG C A GAATTGCAAA 
ATGTAGCGAT ACTTCAGTAA TTCAGAGCTG 
AGCATTAACT ATAACTTGAA ATGATGACAT 
TCATACCCAC TAAAAATGGG AGAAATGCGA 
GTCAGAAGGA TTAGGTACAG TTTGCACAAG 
TTTCAAGAGC TATGAAATTT CAAGTTTTAC 
CCTCTTACAG CTGAAAT CAG TCAATACAGT 
CCAACCAAAC AAACGGCAAA GGAAAAAAGG 
CCTGAGCAGC AGAGTGTGTA AGGTTACTAA 
GAACTGAAAT ATGCAGCTAT GTCACTCTGC 
GTAAAAGTAC CCACATTTAC TATGGAGGTA 
AAAAAAACAG CTATTCAATT GTTTGATCTT 
AAATCAAGGC AGAAATGATA AATGAATTTC 
TATTCAAGGC ACAGAGTCCA CCTAATGTCT 
TTGACTTGTT TTAAAAAGGC CATTAGGTGT 
GTTTATCAGC TTTCAGAAAG AGGATGGCAT 
CTGAAGCCCA GCAAGGGATA AGAGAAATGA 
CCAAAAGTCC TTTTTGTTCC CTGATACCAT 
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122 851 GACTGAAGTT GTCATGTCAA 
122 901 CATGTTCTCT CAGTGCTGTG 

122 951 GCGGAATGAA AACAGCCAGG 

123 001 GAAGTGCTCT CTGGACTAAG 
123 051 AGGGACCATC ACAGAAT CAT 
123101 CTCTCCACAT TTGCCTATCG 
123151 GTTACAAGAC TCTGCATGTG 
123 2 01 AGAGAATATC CTGGTCCAGA 
123 251 AAAGCACAGC ATGTTGAACT 
123 3 01 AGTCTACAAA AT AT CAACTG 
123 3 51 CCTGTGGCTC ATGATCTAAG 
1234 01 ATTACTGTTG AGGAGCAGCA 
123 4 51 CTGAGGGCAA GGGAAGGAGA 
123 501 AGGGAT CAAG TGCTAAATTT 
123 551 CCAGAGAAAT ATGTAGTGAT 
123 6 01 CCAGAATGGC AACAGGTGAT 
123 651 TAAAACAGCA TGGAT C CAT A 
123 701 ACAAACAGTA TTGCTTACTT 
123 751 ACAATG C C AC TGAACCTTTG 
123 8 01 CTAAAGCTCT GTCCAAAACT 
123 851 GATGTGAAAT TATCTGTGCT 
123 9 01 GTAAACTCTA TTTTAAGAAA 

123 951 TGGTATCCAA AGGGACTCCT 

124 0 01 GTTCTTGGCC TGGTGCAGGC 
124 051 CAAGATACTC AGTTTAATGC 
124101 ACATGATTTT AAAAGGAAAG 
124151 CAAATAAGAA AGATACAGAA 

1242 01 GGAAATAATG TTAGAACTCT 
12 4 2 51 CAAGCCTCAC TGGAGGAGCT 

1243 01 GAGTCTGTAC TGGAAATTGG 

1243 51 GTGAGATGAA GCCCTGAACT 

1244 01 TGGATGAAAA CAACTCACTG 
124451 ACATGGCATC TCTGCTCAAA 
124501 GACCTTAAAG GT CAT CGAGT 
124551 CTAACAACCC TCCTCTAAAT 
124 6 01 TTTAAACACA TCCAGGGATG 
124 651 TCCAGTGCTT AACAACACTT 
124 7 01 ATAAACTTAC CCTGGCACAA 
124 751 CAGCACTGCA ATGT AG CAAA 
124 8 01 GACAGAGGAC ATGCAAATAG 

124 851 AGGGGACATG TGAGAGGTAT 
124901 GAAAGGAGAT CTTGGGAGCA 
124951 CTCCATGCTA GAGGATGTAC 

125 001 C ACAGGT C AG CGTCATCCTG 
125 051 TGGAGTAGGG ACAGCCCCGA 
125101 AGAGAAATCT AGTTAGCAGC 
125151 ATTACATACT GACCCTAGCC 
12 52 01 ACTGGGAAGA AGAGAGTGCA 
12 5251 GGAGAACTGA AGCTGACTAG 
1253 01 AT TCC AT AGT TTGCTGGTTT 
125351 TTATGTTAAT TAGCAAAAAA 
12 5401 C C TACAAC AG AAGCTCAGAT 
125451 TAGGGAATGC TGTATGTGTC 
1255 01 CAGACCCAAA CTTTTGCTGA 
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GACATTGCCT TCTGTCTGTT CACTACACCT 
TTCTTAGAGA GGCAGTACTG CTAGTGGTCC 
TGTAAT CACA CTCTTTTGAA TGCCTCATGA 
TGTAACTTTC CTTCTCACAT CATTTGGAGA 
TTGGAGACCA CCTGCATCTA TTCTGCACTT 
TCTTCTAAGC AAACCGAATC TATGG C TGAA 
GTGT C ACAGT CACCAGAGGC AGAGGACTCA 
GCTACTCTGG GATCTCAAAA GTCACTTGGG 
AGACTTTGGT GTATTTTTCC AATCTTTATC 
GACATGGAGC AGTGGTTCAC TTTGGTGTTT 
ATTCTTCAGT CTGGAAAATA AACTGCAGAG 
AGTGTCAGTC TGCTCAGCAA GGGAAGGAGA 
CCGTGCCAGT AAGGCTAGAA GGGCCCGATA 
TCCAATATTA TAGCCATAGT TTAGTGATTT 
GTGGTAGGCC AAAG AT CAAA CCAGAAGGCC 
GATCCCTGTG GCACATTCTC TATCTTGAAG 
TAAATACATT CTTGCTCAAC AGCAGAAATA 
CTACGAATAT CCTAACAAAA CATGTAGATC 
TATGGATGGA ATCTGTGCAA TCTGCCATGA 
GCACAACTTA GGGTGCCCAG CTTCTGAAGG 
ATCTCCTTTT CCCTTCTTGT GTTAGCTCCA 
TACCTTACAG TTTCTGATTG TCTTCTTTAC 
ATGCATTACA GGGTCCTCCA GCACAGTGAG 
ATGCAAAGTA GCTTAGGCAC GGGTCACAAT 
TTCTCCCAAG TGATGGGATG CTAAAATCTT 
TGTTCAAACT GTGGAAGAGA AATCCACTGA 
AATAAAGTTA GCTATAGAAG ACATGGAACA 
GAGGGCAAGA GTAAGCCTTA ACAGTAATGA 
CTTCCACATA CGTTGTTCTC ATGGGCCCAG 
CACACAGTTT GGGTAC CGGG GGCGATCTTT 
GCCCTGGGTC AGCTGCAGGT GTCTCTGTAA 
TGCACCAGAT TTTCAGCTAA TAAGAAAAGC 
CAGAATCATA GAATTGCTCA GGTTGGAAAA 
CCAACTGCAA CCTAACCAAC TACCCTAACT 
CATGTCCCTG AGC AC CAC AT CCAAACAGTT 
GTGAATCAAT CACATCCCTG GGGAGCCTAT 
TCTGTAAAGA AGTTTTTCCT GATATCCAAC 
CTTAAGGCCA AATTTAATTA GAAAATGTAG 
TGTAATTACG AAAAGGTGGT AG CTG CTAGG 
ACCCAAAAGA TAAAGACTAG AAACAGAAAA 
GTTTGGAGAA ACATAACAGA GGAGATATTT 
CAGGCAAAGA CACAATCCTG GGAGGAGGTG 
CTCTAAGGCA CCGCAGCCAT GGGCAACCAA 
GTGAGACTGT ATCCCACAAG CAGCTAACAC 
AGAACTGCAG CCCAGGCAGC ACACTAGAGC 
AACCACTGGC AGACAGAAAT GATTATATAG 
TCTTACACTG CCTACTGCAT CACTGAAAGG 
ATAACGTAGC TGAAACTAGG AGGAAGGCAA 
GGAAAAGGGG GATTAAAGGT TTAAGTGTCT 
GTTTTTTGTC AATTCCTGAA TCAGTAATTT 
TTACAAACAC TCCCCAAGTC AGGACTGTTA 
CAGCTGAGCC TTAGTCTTTT GGTCCCTCCC 
TCTCTCTCCA GGCCTGCTCA AAATTGACCT 
ATCTCCAGTA CCACCTCTTT TGCTCCTAAC 
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125551 TAGATAACAA AGCCCTGAGC GCTTTGCTTT TAGCAAAGCC TTTAAGTGCC 

12 56 01 ATTACCAACT GCACCTGGAG CCTTTACCTA CCCCTATGGA CCCAGGCTCT 

125 651 ATATTTAAGC TCTGCCCTGA ACCTTCACTT CTTTCCTGTC CTAAGTTAGA 

12 5 701 TGTACTAGTA TGGTGTGTAC TATGTCTCCA GTT CAAAC AC AGCTGTGCCC 

125 751 ATACCTGGCC AAGGACTCCT AGTATGACCT GGGCTGTGCC TTGCTGCTAA 

125 8 01 GGACCTGCTG GGTGATTGCT GGACCTGATC CTAATCCTGA ATTAAGAAAT 

12 5 851 GATTTCTTGG CTTGACTGGA TGTGCCCTGT GGTATGATAC TGCCTTATGA 

1259 01 TTTGGACTCT TGTTTGCAGC TGTGCAAATC CCTAAGGAGC CCAGTCTCTG 

125 951 GCCACCTGGA ATCTTGTCAC TACCAAACTT CCTGAGGGAC TGGTCTTGCT 
12 6 0 01 CTGGGTTCTG ATCTCTGGAC AGTACTCACC CTTTACTCAG CCCAGGCTCC 
12 6 051 CAGTTAAGCC CCTTTCCACC CTGCCAGGCT CTCCGCTCCA TCCCTAGCAG 
12 6101 GGGCTCTCAT GACAGTGTGA CCCCCCCTTA CTCAGGTCAG GGCCACTTGT 
126151 GCCACGTTCC TTTCCTGTCT TCTGTCCCTG CCTTGGCTCT AAAGCAGTGT 
12 62 01 GCTACCATCC ACAACCACTG CATCTCTCTA AAGTAAGCCT CTCCTGAGCC 
12 62 51 CAAGTCTCTG TAACGAGGAA GGATGCACTT TGCTCAGAAG GATGCGAGGC 
1263 01 TGCTTCTGAG CTCTGAGGGC ACTGACCTCC CATGAGGTAC ACCCCATACC 
12 63 51 CAGGACCACA ATTCAGCCTG CTGGAACCAT CAACTCCTGC TGGAGTAAGG 
12 6401 CCATAGCAAG ACCAGCATCC ACCTCCCTGC AGCCCTGCCC TGCCCAGATA 
12 64 51 TTGGGCCTGC TGATCTCAGG ATGCAGACTT GCTTCTCAGC TTGACCTAAG 
12 65 01 CATTGCCCTG TCTTTATGGA CCCACCTGGT TAGCAAGTTC AGTGCAGAAG 
12 6551 GAGGCTGTTG GCAT CTAGCT AATTTTCCAC CCACATTACT GTCTGCTGAC 

126 6 01 TCATTCTACG TCTCTCCCAT CTTGTTACAA TAATAATTTG GG AGAT CAT A 
12 6651 TTGAAGGTCT TAATAAAGTC AAGGCATGTG ATATTCTCTG CTTTGCCTTT 
126 7 01 GTTTCTAGAA TAAGCCACTT CAT C ATAG AA GATGAAAATG CTGATCAGCA 
126 7 51 GAGAT CTGTG CTTGATAAAT CCATGCTGGC TTTTCCTATC ACCTTATATT 
12 6 8 01 C CTT CAT ATG CCTTGAGACA CCCAAGGAGG CCTTGGATCA GAGCTGTCTG 

126 851 TAGCAGTCCT AACTGGTATA CAATTAGTTG TACAACAGGT AGTGATCCGC 
1269 01 ATAATAGTTG GCGTGAGAAA GTGGGCCTGT GCTGTGTCAA GCATAGAGTT 
126951 TGGGTTCCAG TCCTGTTCTG CATGGCACAT ATGCCTGAGC AGCTGGGTAA 
12 70 01 TCTCTGCATT CCAATTGGAA GGCAGGGGCC TGTAGGCAGT TCCCACTTGG 
12 7 051 CATGGGTGAT TGTACCACCT GTGT C CT CAT CTGTGAAGCA TCATGTTTTC 
12 7101 ATTCAAATAT CCTTTTGTTT GACAGTAGAA ATGAACAGAA TTGTTTTTTT 
127151 TTCCTAAGCA AATTCTGCAA GAGCTCTGAA GAACAAGGTG TCAGTGAACT 
12 7201 TCTAGCTCCA TAGATAGGAC TTGCATCACA TGTCATGCCT TGATTGGAGG 
12 72 51 TCTATCCGAT ACTGAACAAC TTGTGGTTCC CTGAGGGAAT GTAAGATTAC 
1273 01 TGATACTACT CTCTCTTTAT GTTAGCTACA ATAAATGGTA GGTTAAGCAA 
12 73 51 TAGATACAGA GTTTGAGTGC CTTTCTTACA AG CAT CAT AG TGAACAAATC 
12 7401 CACTGGTGAT CTACCTTTTC AATAACTACA GAGAATTGTA ATCTCTTGGA 
12 7451 TTCTCCTCCT TCCCCGTTCT GAAAATGTGT TCTTTTTTTC CAAATCAGAA 
12 75 01 ACCTTCCTCA ACCACCCTGA CTATT CTTTG GACATTGTTT TGTTCTTGCT 
127551 CCTAAATAGG CTTTATAATT TTTGTAAGTG AAAGGCTTTG CATGCAGGTG 
12 7601 AGGCTACAAC TCATTCAGTA ACAATGAGGA AGACTGT C AG ATTTTGGGGA 
12 7651 AAATTCTCCC ACCCAACCTT TTGCTAGCCA GTAAGATGTA ATCACTGAAT 
12 7701 GTCATGCCAC AAAGACCATA C CAACATC AG AC C ACAT AT C TACAGGAAGC 
12 7751 TTTAAGGAAT CATTGACTGT ACAGTGAAGG GTAAATCAAA TTAAAATGAA 

127 8 01 TGTGAGGTCT GATACGAGAT ATCCTCATGG GAATCAAGAG CAAAGACAAA 

Y:OV-l HOMOLOGY HS-III SITE 

127851 TAGTTTTTCA CAGTCTTGTC ATGATCTGTC ACAGACCAAG GCAGCACAGC 

12 79 01 AGGCAACAAT GTTGGTCTCT TCAGAATGGC ACAGCACCGC TGCAGAAAAA 

12 79 51 TGCCAGGTGG ACT AT G AACT CACATCCAAA GGAGCTTGAC CTGATACCTG 

12 8 0 01 ATTTTCTTCA AACAGGGGAA ACAACACAAT CCCACAAAAC AGCTCAGAGA 

12 8 051 GAAACCATCA CTGATGGCTA CAGCACCAAG GTATGCAATG GCAATCCATT 

12 8101 CGACATTCAT CTGTGACCTG AGCAAAATGA TTTATCTCTC CAT GAAT GGT 

12 8151 TGCTTCTTTC CCTCATGAAA AGGCAATTTC CACACTCACA ATATGCAACA 
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12 82 01 AAGACAAACA GAGAACAATT 
12 8 251 GTGGCAAAGA GGAGAACAAA 
1283 01 GGATAAGAGG CTTTGACCTG 
12 83 51 ATAAAAAGTG CTTTTATAAC 
12 84 01 CTGTTGGTTT AGGGACAGAC 
128451 CAGCAGAGCC TTAGCTGACC 
1285 01 TGTGACAAAA CTATTTGTAC 
12 8551 TCCATTGCCA CCTATCCCAG 
128 6 01 TTACTCTCTC TAGACCCCCA 
128 651 ACACTATGAT GACAGCCTGT 
12 8 7 01 ATTTTTGTTG CTTGCAGCTG 
12 8 751 CTTCTCATCT GACAGAAAAG 

128 8 01 CATTGTTTTA AACCACCGGC 
12 8 851 AAGACTCACA AAACAAAGCT 
12 8 9 01 GTAGGTCTGG AAT CAGGAC A 
128951 AAAGCCTTTG GGATCATTCT 
12 9 0 01 AGTGATAACT GACATAACTG 
129051 AAAGTACAAG ACATAGAGAT 
129101 TGGACAGGTC CGTGGTCCAT 
129151 TTGAGACTGT CTAAGAAGGG 
129201 TGAATCAGAT GTACTGTCCA 

1292 51 TTGGAGAAAT ATAAAGAAAT 

1293 01 AAAGGGATAT GGAGTTAAAA 
1293 51 TACCATGGGC TGCAGAGGGT 
129401 CTGCAGGGAA CTTGTGTTCT 
129451 ACTGACCTCA GTGT CAT C AG 

1295 01 TCTCCCAACT ACCATTGTAC 
12 95 51 AGGTACATCT AGCATCGATC 

1296 01 CTTTTGAGGA CACGGGACAG 

129 651 GGCAGACCTC CACTACCACA 
129 7 01 GAGCTACAGA AATGAAATGG 
129751 GAAACGTGGC GTTTAGCTCT 
129 8 01 GAAGCAGCCT GTTGTTATTC 
129 851 TTAATAAAAG CATTTCTTCA 

12 99 01 ATATTTTTTA TTAATTAATT 
129951 CTTTGGAAGA ATCTTGTCCT 

13 0 0 01 AAATGGCTGA CAATTTTGGC 
13 0 051 CATTAGGACA GAAGTT ATT C 
13 0101 CCTTTAAGTA TTTGGAAGTG 
13 0151 CCTCCCTGTC TGGTAAGCAG 
13 02 01 TCTCTTTTCC TTGTTATCTC 
13 02 51 TGAACTATGC CAACCTGCAA 
13 03 01 CTATACACAA ATTATTAGTG 
13 03 51 AT CTGAGAAA ACCTATTCCC 
13 04 01 GCATTTCATA GGTAGAGATA 
13 0451 AAAAAGCAGG CAAGATTTTC 
13 05 01 AAGACGTGAT TAAAAACAAA 
13 0551 GGTGTAGACA TCCAGCAAAA 
13 0601 AGTCCTCAGA CTTGGCAAGG 
13 0651 TCACAAAAGG AAGGAGAGAA 
13 0 7 01 GCTAGTGGTA TAGGAAAGTA 
13 0 751 TATGTATGTC CTGAAGAATT 
13 08 01 CAAACAGTGC TTTACAGAGG 
13 0 851 CTATTATTTC AATACAGAAC 



50/79 

AATGTGCTCC TTCCTAATGT TAAAATTGTA 
ATCTCAAGTT CTGAGTAGGT TTTAGTGATT 
TGAGCTCACC TGGACTTCAT ATCCTTTTGG 
TTTCAGGTCT CCGAGTCTTT ATTCATGAGA 
CCACAATGAA ATGCCTGGCA TAGGAAAGGG 
TTTTCTTGGG ACAAGCATTG TCAAACAATG 
TGCTTTGCAC AGCTGTGCTG GGCAGGGCAA 
GTAACCTTCC AACTGCAAGA AGATTGTTGC 
AGTCAAACCA ACTATGCAGG TATGCTGACA 
TCTGATCAAG ATCTCATTTG TT CATGGAC A 
GTCTTCCATT GGGAAAGAGT GTAGTATATC 
CAGAAATTCT CATGCTCCAC ACTTAATCTA 
TACTTCTTGG AGAGGAAAAA TGGCTTTTAT 
CTGCAAGTCA AATG CAT AC A AAACTGTTCT 
CTATGTGGAA GTCAAATAGA GCAGCTTTAA 
CATCTTATAT TTGCAGCACG ATACTATGAC 
CAT C AATTT C CTTGATATTT TATTTGTCTT 
GGACGTAAAG ATGGACATAT GACTCAGGTC 
GTATGATAAA AGAGATGAAG GGAAGGAGAA 
CTTCAGGGAC GTTCTGAAGG CAGATTTGAC 
AGT CT CAT AT GTAGCAATGG AAGGCTGATA 
GGCTGTGAAC TCAAAGTGAC CCTGAACAGA 
TAATGTCACA GAACTGAGGT TTATATGATA 
CAGAGTGCTC CACCATGGGC CTCTCTTGGG 
ACACCTGGAA CACCTCCTGC CCTCCTCCGC 
GGCTGTTTCT CTCACATTTT CTCACTCACC 
AGCAGTTGTT CTTACATATT GCTCCTCCTG 
ACTGGCTCAG CTCTGGCCAG TGGCAGCTCC 
CTGCTGGGCT CTGTTCACAG AGGCCACTCC 
ACTTGTAGTG TAAATCCACT ACAACTTTCT 
AGACCCTCTC TGCTATGGGA TACAAAAGAG 
GGCTCACTGG TACACCCAAC CACAGGGTGA 
ACTACTCTTA GGACAGATTA TGGTGAATTG 
TAACATCCAA AGGAGGAAAT ACACTAAATT 
ACACATGCTT AATTATATAT GGCATGGTTG 
TACTGACCAG ATCTGCTGTT TGCTGAGACA 
CATGGTGGAT ACCTTCCCCC TTTTCTGTAG 
TGGAGCCTGT CTGACAAGTT AGACTTGATA 
TGCTTTTCAT GCTGGATGTC ATCTCCAGAA 
TTCCCTGCCT TAGTAAGAGC CGAAACGGTC 
ACCAGGATAT TACAATGTGA CAGGACTATC 
ATTCCAAATA TATATATATA TATAAGATAT 
TTTGATTGAC ACCAGATGAC AGAGAAGTGC 
AATCTCCTTT CTCTTTCTGC AGACTGACAT 
ACATTTACTG GGAAGCACAT CTATCATCAC 
AGACTTTCTT AGTGGCTGAA ATAGAAGCAA 
ATGAAACAAA AAAAATCAGT TGATACCTGT 
AAATATTATT TGCACTACCA TCTTGTCTTA 
AGAATGTAGA TTTCCACAGT AT AT ATGTT T 
ACAAAAGAAA ATGGCACTGA CTAAACTTCA 
ATTCTGCTTA AC AGAG AT TG CAGTGATCTC 
ATGTTGTACT TTTTTCCCCC ATTTTTAAAT 
TCAGAATGGT TTCTTTACTG TTTGTCAATT 
AATAGCTTCT ATAA CTGAAA TATATTTGCT 
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13 0 901 ATTGTATATT ATGATTGTCC CT CGAAC CAT GAACACTCCT CCAGCTGAAT 
13 0 951 TTCACAATTC CTCTGTCATC TGCCAGGCCA TTAAGTTATT CATGGAAGAT 
1310 01 CTTTGAGGAA CACTGCAAGT T CATATCAT A AACACATTTG AAATTGAGTA 
131051 TTGTTTTGCA TTGTATGGAG CTATGTTTTG CTGTATCCTC AGAATAAAAG 
131101 TTTGTTATAA AGCATTCACA CCCATAAAAA GATAGATTTA AATATTCCAA 
131151 CTATAGGAAA GAAAGTGTGT CTGCTCTTCA CTCTAGTCTC AGTTGGCTCC 
1312 01 TTCACATGCA CGCTTCTTTA TTTCTCCTAT TTTGTCAAGA AAATAATAGG 

1312 51 TCAAGTCTTG TTCTCATTTA TGTCCTGTCT AGCGTGGCTC AGATGCACAT 

1313 01 TGTACATACA AGAAGGATCA AATGAAACAG ACTTCTGGTC TGTTACTACA 

1313 51 AC CAT AGT AA TAAGCACACT AACTAATAAT TGCTAATTAT GTTTTCCATC 

NRE: A,B,C regions 

1314 01 TCCAAGGTTC CCACATTTTT CTGTTTTCTT AAAGATCCCA TTATCTGGTT 

silencer* (common site) 

1314 51 GTAACTGAAG CTCAATGGAA CATGAGCAAT A TTTCCCAGT C TTCTCTCCC 

1315 01 AT C CAACAGT C CTGA TGGA T TAGCAGAACA GGCAGAAAAC ACATTGTTAC 

1315 51 CC AGAATTAA AAACTAATAT TTGCTCTCCA TTCAATCCA& AATGGACCTA 

1316 01 TTGAAACTAA AATCTAACCC AATCCCATTA AATGATTTCT A TG GTGTCAA 

1316 51 AGGTCAAACT TCTGAAGGGA ACCTGTGGGT GGGTCACAAT TCAGACTATA 

Ovalbumin exon L 

1317 01 TATTCCCCAG GGCTCAGCCA GTGTCTGTAC ATACAGCTAG AAAGCTG-TAT 

1317 51 TGCCTTTAGC AGTCAAGCTC GAAAGGTAAG CAACTCTCTG GAATTAC CTT 

1318 01 CTCTCTATAT TAGCTCTTAC TTGCACCTAA ACTTTAAAAA ATTAACAATT 

1318 51 ATTGTGCTAT GTGTTGTATC TTTAAGGGTG AAGTACCTGC GTGATACCCC 

1319 01 CTATAAAAAC TTCTCACCTG TGTATGCATT CTGCACTATT TTATTATGTG 
131951 TAAAAGCTTT GTGTTTGTTT TCAGGAGGCT TATTCTTTGT GCTTAAAATA 
132 0 01 TGTTTTTAAT TT CAGAAC AT CTTATCCTGT CGTTCACTAT CTGATATGCT 
13 2 051 TTGCAGTTTG CTTGATTAAC TTCTAGCCCT ACAGAGTGCA CAGAGAGCAA 
13 2101 AATCATGGTG TTCAGTGAAT TCTGGGGAGT TATTTTAATG TGAAAATTCT 
132151 CTAGAAGTTT AATTCCTGCA AAGTGCAGCT G CTGAT CACT ACACAAGATA 
13 22 01 AAAATGTGGG GGGTGCATAA ACGTATATTC TTACAATAAT AGATACATGT 
13 22 51 GAACTTATAT ACAGAAAAGA AAATGAGAAA AATGTGTGTG TGTATACTCA 
13 23 01 CACACGTGGT CAGTAAAAAC TTTTGAGGGG TTTAATACAG AAAATCCAAT 
13 23 51 CCTGAGGCCC CAGCACTCAG TACG CAT AT A AAGGGCTGGG CTCTGAAGGA 
1324 01 CTTCTGACTT T CACAGATT A TATAAATCTC AGGAAAGCAA CTAGATTCAT 
132451 GCTGGCTCCA AAAGCTGTGC TTTATATAAG CACACTGGCT ATACAATAGT 
13 2 5 01 TGTACAGTTC AGCTCTTTAT AATAGAAACA GACAGAACAA GTATAAATCT 
13 2551 TCTATTGGTC TATGTCATGA ACAAGAATT C ATTCAGTGGC TCTGTTTTAT 
132 6 01 AGTAAACATT GCTATTTTAT CATGTCTGCA TTTCTCTTCT GTCTGAATGT 
13 2 6 51 CACCACTAAA ATTTAACTCC ACAGAAAGTT TATACTACAG TACACATGCA 
13 2 7 01 TATCTTTGAG CAAAGCAAAC CATAC CTGAA AGTGCAATAG AG C AG AATAT 
13 2 7 51 GAATTACATG CGTGTCTTTC TCCTAGACTA CATGACCCCA TATAAATTAC 
13 2 8 01 ATTCCTTATC TATTCTGCCA TCACCAAAAC AAAGGTAAAA ATACTTTTGA 
13 2 8 51 AGATCTACTC ATAG CAAGT A GTGTG CAAC A AACAGATATT TCTCTACATT 
13 29 01 TATTTTTAGG GAATAAAAAT AAGAAATAAA ATAGTCAGCA AGCCTCTGCT 
13 2951 TTCTCATATA TCTGTCCAAA CCTAAAGTTT ACTGAAATTT GCTCTTTGAA 
13 3 0 01 TTTCCAGTTT TGCAAGCCTA TCAGATTGTG TTTTAATCAG AGGTACTGAA 
13 3 0 51 AAGTAT CAAT GAATTCTAGC TTTCACTGAA CAAAAATATG TAGAGGCAAC 
13 3101 TGGCTTCTGG GACAGTTTGC TACCCAAAAG ACAACTGAAT GCAAATACAT 
13 3151 AAATAGATTT ATGAATATGG TTTTGAACAT GCACATGAGA GGTGGATATA 
13 32 01 GCAACAGACA CATTACCACA GAATTACTTT AAAACTACTT GTTAACATTT 
13 3 2 51 AATTGCCTAA AAACTGCTCG TAATTTACTG TTGTAGCCTA CCATAGAGTA 
13 33 01 CCCTGCATGG TACTATGTAC AGCATTCCAT CCTTACATTT TCACTGTTCT 

Ovalbumin exon 1 
13 3 3 51 GCTGTTTGCT CTAGACAACT CAGAGTTCAC CATGGGCTCC ATCGGTGCAG 
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1334 01 CAAGCATGGA ATTTTGTTTT GATGTATT.CA AGGAGCTCAA AGTCCACCAT 
13 3451 GCCAATGAGA ACATCTTCTA CTGCCCCATT GCCATCATGT CAGCTCTAGC 
13 3501 CATGGTATAC CTGGGTGCAA AAGACAGCAC CAGGACACAA ATAAATAAGG 

13 3 551 TGAGCCTACA GTTAAAGATT AAAACCTTTG CCCTGCTCAA TGGAGCCACA 
133 6 01 GCACTTAATT GTATGATAAT GTCCCTTGGA AACTGCATAG CTCAGAGGCT 
133 651 GAAAATCTGA AACCAGAGTT AT CTAAAAGT GTGGCCACCT CCAACTCCCA 
13 3 7 01 GAGTGTTACC CAAATGCACT AGCTAGAAAT CTTGAAACTG GATTGCATAA 
13 3 751 CTTCTTTTTG TCATAAC CAT TATTTCAGCT ACTATTATTT TCAATTACAG 

Ovalbumin exon 2 
133 8 01 GTTGTTCACT TTGATAAACT TCCAGGATTC GGAGACAGTA TTGAAGCTCA 
133 851 GGTACAGAAA TAATTT C AC C TCCTTCTCTA TGTCCCTTTC CTCTGAGAAG 

133 9 01 CAAAATACAG CAGATGAAGC AATCTCTTAA CTGTTCCAAG CCCTCTCTGA 
13 3 951 TGAGCAGCTA GTGCTCTGCA TCCAGCAGTT GGGAGAACAC TGTTCATAAG 

134 0 01 AACAGAGAAA AAGAAGGAAG TAACAGGGGA TTCAGAACAA ACAGAAGATA 
134 051 AAACTCAGGA CAAAAAT AC C GTGTGAATGA GGAAACTTGT GGATATTTGT 
134101 ACGCTTAAGC AAGAC AG CT A GATGATTCTG GATAAATGGG TCTGGTTGGA 
134151 AAAGAAGGAA AGCCTGGCTG ATCTGCTGGA GCTAGATTAT TGCAGCAGGT 

1342 01 AGGCAGGAGT TCCCTAGAGA AAAGTATGAG GGAATTACAG AAGAAAAACA 
134251 GCACAAAATT GTAAATATTG GAAAAGGACC AC AT C AGTGT AGTTACTAGC 

1343 01 AGTAAGACAG ACAGGATGAA AAATAGTTTT GTAAACAGAA GTATCTAACT 

1343 51 ACTTTACTCT GTTCATACAC TACGTAAAAC CTACTAAGTA ATAAAACTAG 

Ovalbumin exon 3 

1344 01 AATAACAACA TCTTTCTTTC TCTTTGTATT CAGTGTGGCA CATCTGTAAA 
134451 CGTTCACTCT TCACTTAGAG ACATCCTCAA CCAAATCACC AAACCAAATG 
134501 ATGTTTATTC GTTCAGCCTT GCCAGTAGAC TTTATGCTGA AGAGAGATAC 
134551 CCAATCCTGC CAGTAAGTTG CTCTAAAATC TGATCTGAGT GTATTT C CAT 
134 6 01 GCCAAAGCTC TACCATTCTG TAATGCAAAA ACAGTCAGAG TTCCACATGT 
134 651 TTCACTAAGA AAATTTCTTT TTCTCTTGTT TTTACAAATG AAAGAGAGGA 
134 701 CAAATAACAT TTCTCTATCA CCGACCTGAA ACTCTACAGT CTTCAGAGAA 
134 751 TGAATGGCTT GCTAAAAGAA TGTCAAATCT T AC T ATAC AG CTATTT CAT A 
134 8 01 TTACACTACT AAATACACTA T AAGG CAT AG CATGTAGTAA TACAGTGTAA 
134 851 AATAGCTTTT TACACTACTA TATTATTAAT ATCTGTTAAT TCCAGTCTTG 
1349 01 CATTTCACAT TTGCAAAACG TTTTGAAATT CGTATCTGAA AGCTGAATAC 

Ovalbumin exon 4 
134951 TCTTGCTTTA CAGGAATACT TGCAGTGTGT GAAGGAACTG TATAGAGGAG 
13 5 0 01 GCTTGGAACC TATCAACTTT CAAACAGCTG CAGATCAAGC CAGAGAGCTC 
135051 ATCAATTCCT GGGTAGAAAG TCAGACAAAT GGTAAGGTAG AACATGCTTT 
135101 GTACATAGTG AGAGTTGGTT CACCCTAATA CTGAGAACCT GGATATAGCT 
13 5151 C AG C CAGCGT GCTTTGCGTT CAAGCTTACC AGAGCTGTTG TATGCCTGTT 
135201 AAGCAGGGCA TACAGT CATG AGGCTCTTGA AAAATCTTAA CAGACAAAGG 
13 52 51 GCAATGGAAA ATCGGAGTTA AGGGATGGTA GGGATAAAAT GCATAGAAAG 
1353 01 AGGTACCACA ATTTTGATTT TTGCCCTAAT GCCTCTCTGC GTGGTTCCTC 
13 53 51 AATTTTT CTA CTTCATTCCT CATCTCCTCA GAGCATTCCT TTCCCTCATG 
13 54 01 CTTGAAACAC AGATGAAAGA CTGTGAATTC TAACTGAGAT GAAAACATCC 
135451 ACAACCACAC AACCTCTGGT GTGGAGT CAC ATTCTGTGAA GGCAAAAACT 
13 5501 AGGCCACGTA ATCTATGTGT GCAAGCTACG TGTAAGCTAT GTGTGTGACA 
135551 GGACAATGTG AGGAACATAC TATGTGCACA AGGACTGCAG AATAAACAGG 
13 56 01 AGCAAAGTTT TTGAAGAAAA CAGAGTAAAA TCCTGTTTTC CTCTTTTGTT 
13 5651 ACATTCTTTA CATATATCTC AAATTTCCTC TTTGGTTAGA AGCAAGTAAT 
13 57 01 ATTTATGTTT CTTGGTACTG TTTGGGTTGA AGACCATTCT GGGATAAGAG 
13 5751 AAATTCCAGT GGTTCTTCCC CTAAT CAT AA AATGTACAGG TTTAGTTTTT 
13 58 01 TTGTAACACA GAAATCTCTT CATCTTTTAT CTTTTGTTGT GATTCTTTAT 
135851 AGAGAGAGAA ACAAGACTTA CTGACAATAG C AG C AAG AAA AT CAATCTTG 
135901 GAAGAACAAG ATTGCAGTTG CAAAAACAAA CCAATGTCCT TGCCCCTACA 
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135 951 TCCTCTTCCC CATAAATTCT ACATTCTCTA TCTACCTTGT GCTTGCCAAC 

136 001 ATGATATACG TAAACTCTCT TTTCGTATTC ATTCTTAAAG GAATTATCAG 

Ovalbumin exon 5 
13 6 0 51 AAATGTCCTT CAGCCAAGCT CCGTGGATTC TCAAACTGCA ATGGTTCTGG 
13 6101 TTAATGCCAT TGTCTTCAAA GGACTGTGGG AGAAAGCATT TAAGGAT GAA 
136151 GACACACAAG CAATGCCTTT CAGAGT GACT GAGGTATATG GGCATACCTT 
13 62 01 AGAGATGTAA TCTAGAATTT ATGAAGAGAG TAGACATGTT GTTATATGAA 
13 62 51 CACTGCATTA GCGTATCTGC TCATTTGTCT GCATCTCTTT CAGACACTGT 
13 6301 GTTAAAAGCA GGGAATTTTC CTTATGTCTC TCTCATCACA ATATTCCTGA 
13 63 51 CATTGCAAAG CTCCTGAGAA ATAACTTCAG ATTCCCACTT TTCCTAGGAA 
13 6401 GGTCTTCCTG GATGAGAACA AT C AAT CAT C TTAACTGTAA CTAGATATTT 
13 6451 CTGCATCTAA GAATAAT CTT TGTTAAAACT ATATTCTCTC TCTCTTTTTT 

Ovalbumin exon 6 
13 65 01 TTTTTTTTTT GGTTCTCCAG CAAGAAAGCA AACCTGTGCA GATGATGTAC 
13 6551 CAGATTGGTT TATTTAGAGT GGCATCAATG GCTTCTGAGA AAATGAAGAT 
13 66 01 CCTGGAGCTT CCATTTGCCA GTGGGACAAT GAGCATGTTG GTGCTGTTGC 

136 651 CTGATGAAGT CTCAGGCCTT GAGCAGGTAT GGCCCTAGAA GTTGGCTTCA 
13 67 01 GAATATTAAA AACACATGGA AATTTAGCTG TTGTAAAGCT CTTTTCAACA 
136751 CAGTTATCCT AAAACATTTA ACCAGCACAA ATTTCATCAT GATTCAATAT 
13 68 01 GTGATTGTTG CATAGAAGTG TAGATTTGTC CCACTGGGTC CTGCAATAGC 
13 6 8 51 CCATGCTGAG CATGGCTTGC TGAAAGAACT GCTTTAGAGG GTGAAAAGTT 
13 69 01 TGACACAGCA GACAAGATGA TTCTCACCTA AGCAGCTGTT ACTGTAGTGG 
13 69 51 CTTGAACTCT AAAGGTCTTG TATCTCCATT CCTGTGCACT GAGGAG CTT C 
13 70 01 TTGGAAAGTT CATATAAGGT TTACTAGTTC TAACTATTAT CTCATTTGGT 

137 051 GGCACTCAAT GTGCTTTGTT CACGTCTTCA TAAATTAATC TATCTAAAAA 
13 7101 TTGGATGTGG TTAAAGCAAT TTCAGAAATA ACATGTACAT AATGTACAAT 
13 7151 TATTGATATG AACAGAACAC AGGCATAGCA TATTGTAATT AGGAGGACTG 
13 72 01 TAGTTATTTT GAATAGGAAA CACAATGTAA TAAATGAGAA TTCATTGAAA 
1372 51 TGTTAGTATG CTAACT CAAT CTAAATTATA AAGATAAAGA GGCATTTAAT 
13 7301 CACAGCTAGA TTTCCATCAC TTGTGACAGA CAGGCATATG AATGATTATG 
13 73 51 TACAGCTCTA GGAAAAAAAG TATGTAGGAA AACTAGTACA TTTTGATTAG 
13 7401 AAAGTCTGAA AATGAGGTGC CTTGATCAAA GAGAATACGT GTGTTTGAGA 
13 7451 AAAAAAAAGT TTGGATAGAG GTGGTAAGAG AGAATATATT GAAATGGTGT 
13 75 01 TTCTACAAAC TGCCATGGCC AGATTTGTGT AAGAGACATT CAGTAAGTAG 
13 7551 GCAAGGAAAG AAATATTACT AGGTACAAAG CAACATTAGT AATACCAAAA 
1376 01 GAAACCAATT ATTCCAGATG CCAATCTCGT AATAGGGTTA AGAGATTTCC 
13 76 51 ACCCCTCTAG TAGTCACCAG TGCAACCAGT AACTTTGCTA ATTTACATTT 
13 77 01 TCTTTTTTTA AATGGCAGAT ATAGCTTTGA ACTGAGTGAT CATGAACTGG 
13 7 751 TACTGTGTAA ATAAGATGGA AGCATACTTG GGAGCTAAAC TTCTAGTTTT. 
13 7 8 01 TAAAAACTCA AATTCTCTTG AAAGATCAGT TCCCAGTCTA GTAACAGCTG 
13 78 51 ATAGTTTAAG TATCAGTAAT TGGCTACCAT TAACAACTGG CTCCTGAGAG 
13 79 01 GTCTTAAATG T AG AGAC AG C TTTAAACTCA AAAGCACAGA GTGATTTTTA 
1379 51 GAATAGATTT CCCAAGCAAA GAAAATAAAC AGGGAGGAGC TTTAAGGGAG 
13 8 0 01 TAGCCATCTC ATTATTATTA TTATTTAAAG AAATGGC AG C AAGGCTATAA 
13 8 0 51 AAGAAAAATA AGACAGAGCA GAGAAGAAAG AGTCATGGTA TGCTTTTCTA 
13 8101 TCTTAGCAAA ATTAAT CTCT ACATGCCTAG G AAAAAGC C A TGACAAGAGC 
13 8151 AATCAGTTCA AAAGGTGTAT GCAAAAAAAC ACATAATAGT AACTAGTACT 
13 82 01 GCATTGCCAG GAAGGAAGTT ATGT CG C CAT TCCATGGATC T C ATT CT CAT 

Ovalbumin exon 7 
1382 51 TTCCTTGCAG CTTGAGAGTA TAATCAACTT TGAAAAACTG ACTGAATGGA 
13 83 01 CCAGTTCTAA TGTTATGGAA GAGAGGAAGA TCAAAGTGTA CTTACCTCGC 
13 83 51 AT GAAGATGG AGGAAAAATA CAACCTCACA TCTGTCTTAA TGGCTATGGG 
13 84 01 CATTACTGAC GTGTTTAGCT CTTCAGCCAA TCTGTCTGGC ATCTCCTCAG 
13 84 51 CAGAGAGCCT GAAGATATCT CAAGCTGTCC ATGCAGCACA TGCAGAAATC 
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13 85 01 AATGAAGCAG GCAGAGAGGT 
13 8551 TGCAAGCGTC TCTGAAGAAT 
13 8 601 T CAAGC AC AT CGCAACCAAC 
13 8651 CCT TAAA AAG AAGAAAGCTG 
13 8701 AG C ACTGT AG TAT C AGGGGT 
13 8 751 AGACTTCATA AAAGCTGGAG 
13 88 01 TACACTGTGA GAACAGGTGC 
13 88 51 TGGTAACTCA TGGATGAAGG 
13 89 01 TGAGTCATCA CACTGAAAAA 
13 8951 GGGGGAAAAA TGCAGCCTTC 
1390 01 TGCTCCAGAA TTAGTCACTC 
13 9 051 ACCAACCATT CCTATGCTGA 
139101 ATACTACAAG GCTCTTCCTG 
13 9151 TAATTCACAT TTCTCCCTAA 
13 9201 TATGGTATAT TACTATTCAA 

1392 51 TCTTGTGAAT GTGCTCTTTT 
139301 AGCAAACACT TTTCACTTGT 

1393 51 AGGGAAAGAA TGACATGCAG 
13 94 01 CGACTGTAGA ACAAGTACTA 
139451 CAGCAACTTA ATCTAATAAG 
13 9 501 GTGAGGGATG AAATTGGTTC 
13 9551 TCTGGAGCAG TGTGTCCAAT 
1396 01 TGGGACAGAC CTTGGCTGCC 
13 9651 TCAACTCAGA TTTACCCAAG 
13 97 01 AAACTTATGG CTAGTGATGA 
13 9751 CAAGGCTGGC ATGCTGGAAT 
13 98 01 ACAGCACCAC TGACATGGGC 
13 98 51 TGCCTCAAAG CCACTTCTCT 
139901 CAGGAGCTGG GTTTGTTCAA 

13 9951 GTACCTTCAC AGGGACAGAG 

14 0 0 01 CTGACTTTCA AATGTGCCTA 
14 0 0 51 AAGAAGGGGT TCAGAAGCCA 
140101 GTTGAAGACA TTTGTGCTGA 
14 0151 GGATTGTGGC TAACCCACTA 
14 02 01 AGGAAGAGCA AAAGAACATG 
14 02 51 CTACTAAAAG GTGCTGTGAA 
14 03 01 TGAAAGAGGC CGCAGCACAC 
14 03 51 GGGCAGGAGG GCTGGGGGAA 
14 04 01 TGCACTCCTG AGGGGTGGAC 
14 0451 CCTGTGATGA ACTGTCACAC 
14 05 01 GCAGTGCGCT TACTGCAGGG 
140551 CAGCTTGGCA CCAATATTCA 
140601 ACCACACAGC ATTTCGCTTG 
14 0651 AGATAACACA TACTTTGCCA 
14 07 01 AGCAAACAGG ATCTTGCAAA 
14 07 51 TGGTGTATAA ATAGTT C AAA 
14 08 01 GATTGCAGGT CAGGAGATAA 
14 0 851 TGTTTAATAT TGTCTTGGAC 
140901 ACTCAAAATC CTCAGGACGT 
14 0951 CACTATTTCT AGGCCATTGG 
1410 01 CTCTGCTATC TCACTCCCTT 
141051 CAGGAGAAGA CTTAGACTCA 
141101 ATTGCTCTGT CACCACACCC 
141151 AAGAAG CACA GTACATTAGT 
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GGTAGGGTCA GCAGAGGCTG GAGTGGATGC 
TTAGGGCTGA CCATCCATTC CTCTTCTGTA 
GCCGTTCTCT TCTTTGGCAG ATGTGTTTCC 

AAAAACTCTG TCCCTTCCAA CAAGACCCAG 
AAAATGAAAA GTATGTTATC TGCTGCATCC 
CTTAATCTAG AAAAAAAATC AGAAAGAAAT 
AATTCACTTT TCCTTTACAC AGAGTAATAC 
CTTAAGGGAA TGAAATTGGA CTCACAGTAC 
TGCAACCTGA TACATCAGCA GAAGGTTTAT 
CAATTAAGCC AGATAT CTGT ATGAC CAAGC 
AAAATCTCTC AGATTAAATT ATCAACTGTC 
CAAGGCAATT GCTTGTTCTC TGTGTTCCTG 
ACTT C CT AAA GATGCATTAT AAAAATCTTA 
ACTTTGACTC AAT CATGGT A TGTTGGCAAA 
ATTGTTTTCC TTGTACCCAT ATGTAATGGG 
GTTCCTTTAA T CAT AAT AAA AACATGTTTA 
AGTATTTGAA GTACAGCAAG GTTGTGTAGC 
AGGAATAAGT ATGGACACAC AGG CTAGCAG 
ATGGGTGAGA AGTTGAACAA GAGTCCCCTA 
CTAGTGGTCT ACATCAGCTA AAAGAGCATA 
TCCTTTCTAA GCATCACCTG GGACAACTCA 
CTGCCGCTGC CCTGATCCTG GCTGGGGTGA 
ACTGAGACAT CTGAGACACT GAG AT CTGT C 
AACAGATCAT TGCCAACAGA ACAAAATCTC 
CAGCAGTCAG TTGTCCCATC TGTGACCCAC 
GAGCAGGCTT TGGTGGCTTG TAGTTACTGG 
AGGGGAAAAA CTGAGCATGG TGTAAATCAC 
GTGCCTGCAC CATGCTTGAA AGCTCTTCTA 
GAAAGCTTCT GTTTCTCCCA TCTGCTTCTT 
TTAGAAGGGT ACAGCCATGG CTGGAAGGGG 
ATTTTCCTTT GGTTGCTGCT GCAGCTGCAG 
AGAGCTTTGA GATAAGGATG CCTAACCTAT 
CACCTCAGGC CCCAGGATAG GACAACTGCT 
GCTACAGAAC CTAATTTATA TTACCAGATT 
TATTTATAAC AGGAGGTCTT CTGTGCTTCT 
GGAGCCCACA GTGCAGCAGT GTATGAGGCC 
GAAGAGC C CT GGTAGGAGCA GCACACAGAG 
CTGCCACCCA TGGGGAC CTG TGTGAAGCAG 
TGCGTGGGAA AGGAAAAGAA AGCAAACAGA 
AGACTGCAGA GTGACAGAGG AGGGCTTGAG 
AGTGGCGCTC CTTCCTCACA GCAGCGCTAA 
GTAGTCTGTG GTGATGCTTT TTCCAGTTTC 
TTCTACTTGT TTTAGCTTTC CCCCTCCACA 
GTCAGTCCCT AAGACCTTAG CCTAACAGTT 
AGAAGGAAGA TAACATGACA CCACCTTCAC 
TACTTTCCTT CACTTTCCCG TAAATTAGTT 
CAGGGGAACT TACTGCAAGA GAGAAAATGA 
TTTCTGGTGG TCTGGGCATG AAAATGGAGT 
TTATTTTTCA CCTGATTTAT TCCCAAACTG 
AGTTCTTATC AATTAAATTA TACTTTGGCT 
TCATCTTCAG CATCACTTTC AG CACAATT A 
GAGCTTTAGG ACTCATCATA AGAGGCTTTC 
CAT AT AG AT C TGTAGTATAC CACACATGTG 
GCATTACAGA GAGACAAAAC CACACCTATT 
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141201 
141251 
141301 
141351 
141401 
141451 
141501 
141551 
141601 
141651 
141701 
141751 
141801 
141851 
141901 
141951 
142001 
142051 
142101 
142151 
142201 
142251 
142301 
142351 
142401 
142451 
142501 
142551 
142601 
142651 
142701 
142751 
142801 
142851 
142901 
142951 
143001 
143051 
143101 
143151 
143201 
143251 
143301 
143351 
143401 
143451 
143501 
143551 
143601 
143651 
143701 
143751 
143801 
143851 



TGTGTGCCTG 
TCTATAAAAA 
CCTGATTCTT 
GTCAGCAATT 
TTTCCAGTAC 
ACCTTGCTCA 
CCTTCCCTAG 
TTCTGATTAA 
GTAGTGAAAG 
AGCTGCAGGT 
CCACAAAGCT 
TCTACAGATT 
CAACAAATTC 
GTAGCTGCAC 
ACTCCATGTT 
CTTCCTTTTC 
GTT CTGTATA 
ATGGTACTCC 
ATATTTTTTT 
CAATAGCAAC 
ACCATACTGA 
CCTTAGTCAC 
GTATGGGAAA 
AGGAAGGCAG 
T AC CAT CAAC 
GATTCTCCGC 
ACGTCCCTTC 
TTTTGCTCTG 
AAGAGGG C AG 
GGAGAGAAAG 
AGTACTTAGT 
GAGGGTGATC 
CCTTGCATAC 
GCAGAGTCCA 
GAGAAACAAT 
AGCTATACCC 
AAGAGT CTAG 
ATCTGGTGGT 
CCAGCCCAAA 
AACTGGAAAG 
GCATGTGATC 
AGTTCACCTT 
AACAAAATCT 
TCATCTTGTT 
CTACTATGCA 
TCAAGAGACT 
GATGGAAAAA 
ATACTAACAA 
GGAAAATTCT 
AAGACTTGCT 
TCATGCTAAA 
TACTGAACAG 
TTTCACTGCC 
TATTCCACCG 



CAGTCTTACA 
TGAGAGAATA 
CAAACGTAAG 
TGCAAGAACT 
TTCTGGCTGG 
AAGCAGGACT 
TCAAGTGTTG 
TTTGATTTTA 
CCATTCAACT 
GAAAATAAGC 
CTCACTGTCT 
AT CT CTGTAA 
CTACATTTAA 
ACTACACATT 
GCTGCCATGT 
TAACTAGTAT 
GTCTAGCTGA 
TGAGGAAAAA 
TTAACCTACA 
CCACAGTCTG 
TGACAGGAAT 
ATAGGTAAAA 
GGGAAAATAA 
ATATATGGGA 
TTTCACAGGG 
TGCCTCATTT 
CTCAAACTGT 
TCCTTCAGCA 
TTCTGGAGTC 
GTTTGGGAGT 
CCAAGCTGCT 
AATGCCATTC 
AGTATACTAG 
AGGTAAAGAG 
TAGCAACTGC 
AGGGGTCTGG 
AGCAGTTTAA 
AAGCTTTTAA 
ATATAATGGA 
AACAGCTTCA 
ATTAGATTTC 
TCTGTGGGAC 
TTGCATCTCA 
TTACTTGGTG 
GAACTTGCTA 
AAAATGCTTC 
TCTTGCAGTG 
ATGGCACTCT 
ATTCCAAGCT 
G CAT AC AC AA 
AAGTGTGAAA 
CAAAGATTGC 
GAAACTT CTG 
CACAGGAGTA 
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CCAGCAGGAA 
TGGCCCCTGG 
AATGCAAGTA 
GATATTCTGA 
AACGGGCAGC 
AT CTT CAAAG 
CCTATCTGCA 
AACATAAATT 
GGCTTTACTT 
CCTGCCAGTT 
GTCCTCACTT 
ATCCCAAGCA 
CAATATTTAA 
TTTAGACCCA 
GGATAACAAT 
ACTCTTAAAC 
CTTGTGACAA 
AAGGATCCCA 
GAGCACCTTG 
AAATCAATGC 
AGTGCAAGTC 
TTTAGAAATT 
TGGGGTGCCA 
ATTAAAGGTG 
CTGTATGTAA 
CTTCTGGGCA 
TACCTAATCC 
GTCTCTACCT 
AAGCTCTGTT 
GTGAGGAGAG 
TTCAGCTGCA 
CTCCAGCCAC 
CTTTCCTTAG 
GCTTTGT CTA 
AAAATCAAGA 
CAAATGAAGG 
GGGAATAATG 
CTTTGAAAGA 
GCCATAAAGG 
AAGAGCTTGG 
AAAAGAAGGT 
AAAAGATGCG 
TTTTGCCTGA 
TGTATCACAT 
ACTTGAACCA 
TTGCAACAGG 
ATGAAGGCAC 
ATCTTTCCCA 
CTCTTTGAGG 
GCGCACTTAG 
ATAGAAATAC 
CAGTACGTGT 
AAAAT AT C AA 
TTTGTGGTTG 



GATAATTAAC 
GTCCTACTGC 
AAATTACTCA 
AGTT CAAGTA 
TGAAAAT C AC 
CCATATCAGA 
TGGTTGGAGA 
CAAATGT C AC 
TCTCTTACCA 
CTCATTTTTT 
GTAATACTTT 
GTACCTAGTC 
GAGCAAAGGC 
ATAGTATAAT 
GCGCAATCAT 
GTCACAAGAT 
GAGCAAACAC 
AACTAATTTT 
CTACTTCTGC 
AATGAACTTC 
CTTACACTGG 
GCAGCTCTGA 
GATGAGTGCA 
GACAAGGGAT 
AAGCAGCTCT 
AAGTTTGTTA 
CACCCTCATT 
GCTTCTTAAG 
TCTATGAGGG 
CCTTTTTCCT 
TCTGCAGAAG 
AGAGCAAGGG 
TCAAATGTTT 
CAGCTAGGTC 
GGTACTAAGA 
GGGACAGATC 
CCACTAGTTT 
G AC AGAAAT C 
TCTGCACGTA 
AAGTGCTGAG 
CCTCAGCACA 
TCCCTCACAA 
GAGGAGAAGG 
CATTAATTTC 
TGTAAAAAGC 
CAGAGTGTGA 
TGATAAGAGA 
AGATCTTTGT 
GGTTACCATG 
T C AGGT C ACT 
AAATAAGGGG 
C CACAATG AG 
CTGCCTTATG 
AGCTGCATGA 



GTAATGAATT 

TTGTTCTAGT 

CTTGAACAAA 

ATTAGAGTGA 

CTGGTCCAGC 

TAGCTCCAGA 

ACCCACAGCC 

TAGCGTAGCA 

AATGAGAGTT 

CTCCCACAGC 

TGAAC CAACA 

ACCACGTGAA 

C AG AC CAT AT 

TTATACTTTG 

TTGTACCTGG 

AAAGACTCTA 

TCACAATTTC 

GAGCTTTTAC 

TGAATGTTAG 

TACTATGGGT 

AAGGCTGACT 

TAAGAGATCA 

CCTTCCTGAA 

GCTGTGGAGG 

CTTTCCTGTT 

CTCTCCAGTA 

GCCTTCTCTG 

GTAGTGAAGT 

TAAAGGCCAG 

GTGTTGTTCA 

ATGGGGAATG 

CTTTGCCTCT 

CCTCTGTGCT 

TATGTTCCTA 

AAGCCTCTGA 

AAGAAGAAAG 

TAAGCCACAC 

T C AAG AT AC A 

GCTGAATCCC 

GTGAAGAAGA 

ATAAC CAGAA 

AGGCTGGGGG 

AAATACAAGA 

TATTTGGTCA 

ACACTAGGTC 

GAGAT GGAAG 

TGTTGAAATG 

CAGCATGAAG 

TT CCAGGAT A 

CAGATCAGTC 

C C AAG C AG AT 

TATTTGGACA 

AAACTCTGGT 

AGAAATAGCA 
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143901 
143951 
144001 
144051 
144101 
144151 
144201 
144251 
144301 
144351 
144401 
144451 
144501 
144551 
144601 

144651 
144701 
144751 
144801 
144851 
144901 
144951 
145001 
145051 
145101 
145151 
145201 
145251 
145301 
145351 
145401 
145451 
145501 
145551 
145601 
145651 
145701 
145751 
145801 
145851 
145901 
145951 
146001 
146051 
146101 
146151 
146201 
146251 
146301 
146351 
146401 
146451 
146501 



AGTGTTTAAA 
TTGCACATCC 
TTCAGCAGTT 
GAAAAATGAC 
TACTCACAAT 
TATTTTTGGT 
TGATTCATGC 
ATACTCACTT 
TTCTCCAGAA 
CACTGTCTTT 
GTGCAGTATG 
CACGCCTTTG 
GACAGGAACA 
TGGAGTATGT 
ATATGTACGC 

GTGTTCTCAC 
CTCTTTTGAA 
TCCCACATAA 
AGACATATAC 

TTGCTGCATT 
CCCACTCCAA 
CGTGCTCCCA 
TATGAAACTA 
GTTTCAAAAC 
AATGGTTTGG 
TCTGGCTGAG 
ACCATTCCCC 
ATTAGAAGAT 
TTCTTTTCAA 
GAATTACAAA 
CATGGCAAGT 
TCCTCTCAAA 
TAG AC AG CAA 
GCAACATACT 
AACTGAAGGA 
CCTGGTGTGC 
ACAGTGAATG 
CAAACATAAG 
AGTGAGGGAA 
CTGTACCTCC 
AGT CATC AC A 
TGTATGTATT 
TTTTTAAAAG 
TTCAAATCCC 
ACAGAGCAAG 
AAAACAGAAA 
ATCTATCATT 
CATTTAAAGC 
TTCCCAGGCG 
TACTATACTC 
AAACATGCTG 
AAAGAAC CAT 
CTTCAGCAAA 



CTGATTTCTT 
ATGCGTGGCT 
GTGTAGCTGA 
ATGTTTAGGC 
GAAGTACTAT 
TTTAATTGCT 
AAAGGAAAAC 
C C AGATAAAC 
GTGTTGCACT 
TCTAATTACG 
ACAAATTATG 
CGTTCTGCAG 
CCACCACACC 
CAGCAAAGAT 
TTTAGATAGC 

TCAAAGCTTT 
CTTTAAAAGC 
AAATCTTAGG 
ACCATTGTTA 

CTGTGACAAA 
GCACCTGATC 
TTCCCCATAC 
AATGTTAAGC 
CATTGTCTTC 
ACATTTAAAG 
ACAAAAACAC 
CCCAGATAAA 
GCCAAAGAAA 
ATACCAAACA 
GGGTTCGTAG 
TT CAC AC AAA 
ACCCTCAAGT 
GTATTTAACC 
GTGTAGGAAA 
AAACAAAATG 
TGCCAATATT 
AACTACAGGC 
ACTTTGAGCA 
CCGCTGCTCC 
TAG C C TT GGT 
TTGATTTATT 
TTTTTTTCCA 
GCAACGTTAA 
ACATTTCTGC 
TACTATGACT 
AT CC AAGTAA 
TGTTCTACAA 
TTTGGAAATC 
CAATGGAAGC 
TGGAAAACGT 
TCTTTATGTA 
CCTCCTAAAG 
GCTTTGAGTT 
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AAAAGAGAGC 
CCTCTTCAGG 
CGTAGTTATA 
T AAAGTG CAT 
GCAGCATGAA 
ACCTTGGTTA 
TGCACACTGG 
TAATCTCTAC 
GGT CAT C AGA 
G CAT CTAAGC 
CCAGTGTTCA 
GTGTGGAGCA 
TCTTAACAAC 
TGCTTGGCAA 
TATCTACACT 

MAR 
AAAGGGAAAT 
TATTTGAGAC 
AC AT AAAAT C 
CGCTTTAATT 
AGGGCTCCTT 
ACTGATC AC C 
TGTTTTGGCT 
TGCCCTCCCA 
CCCACCCCTC 
TCCTTTCATG 
GAGCAGAATG 
TAAAC AG CAC 
TGGAGTAGCT 
CCTTAGGTTT 
TTAAAAC AG C 
TACTGTGTGA 
ACATTTAGAT 
TACTCCTGAT 
GTTGCACTGA 
TGCTTATATA 
TGTCCTGCCA 
TTACTTCCCA 
TGTTGGAGTG 
TGAGTCAGCC 
AACTTCACAG 
GAAT AT CTTC 
CTAATAACCA 
CACCCTGTAC 
ATATATGGAA 
AGTCAGCAAA 
CTGCTCCAAG 
CACAGTTAAT 
ATTTAAGGCC 
AATTAATTCT 
AACAGATGCA 
GCTATGAATT 
CTATGTGGCT 
CCCAGGTTAC 



CTTTCCTCTA 
AGCAGGAATT 
CCCTTTGAGA 
GTAATC CACA 
ATTCAGGCTA 
CTTAAAAAAT 
TAGATGTGAG 
TCAGATATCG 
ACTGAGTATC 
TAAAGCACAC 
ATT C ATGTGC 
AAATGCCTCA 
TCATAAATTC 
AGGTTGCAAA 
GTTTCAAAAT 

AAGAT ACT C A 
TTCACGATGA 
CATTACAACC 
TACAAGGTCA 

TGCACACCAA 
ATTACCATCA 
CTTGCCAATT 
CAGGATTCCA 
TTATCTCCTG 
TTTAAAACTT 
CTCTGTTGGC 
TTTTACTGGT 
TTT CTTCAAG 
GAATTACATT 
ATACGTACAT 
AAGCAGAAGC 
CACTTTATAA 
CCCAGGTACC 
CTTGTGCTAA 
GCTGAGATCT 
AAATGGAAAC 
CAGGAAGGAT 
TTGAQTTAGT 
TCAGCACCGC 
GATGCTGGAA 
CTTTTAGATT 
GCCATGCTAT 
AAAC AC CAT C 
CATGTCAGTC 
TT AAATT C AT 
GGATGAAGTT 
TTTG CAAAGA 
AATGTAAACA 
GCAGCACACC 
T CTAATT ATA 
CAAAACAGCT 
GCTCACCTGT 
ACTGATGCAG 



CATGCTGCTC 
GGTTTTCTGA 
GATTTCTTCA 
CATACACCAT 
TTCTTCTTCA 
GCTCACCATC 
AAC AG CACGC 
AGATCATTGC 
TCAGGAAAAG 
AGCGGTAATA 
CAAATCTCAC 
GTGATATTTA 
TAAATGCTAT 
TGTACATGTA 
AAAGACGCGT 

AAGAAATAAT 
TACAAACTTA 
ATTCCAGCTG 
GGACAAGCTC 

AATCCATGCA 
CTTCAGTCTC 
ACAGGATTGT 
ACATTCTCAG 
AAGTCCTTAT 
ACTGGCCTGC 
TGAACCAGAA 
AAAAAAAGAT 
CATAATTTTT 
AGATTTT CAA 
GAAAAC CAGC 
TACCAAACCT 
ATGAT CTATG 
AATGAACTGA 
GTTGCACGGA 
GGCCAGGGTG 
ATGAATGACC 
ACTACCAATA 
AGAGAGTGGG 
CCATTGAACT 
AATATTATCA 
TTAGTTGCTT 
TCAAAAGGCA 
CTCTCATTCA 
ACTTCTTGTA 
CCCTGCTTTA 
TATTTAGTGT 
TGACTCAAAT 
GATTACAACT 
TCTCCTACAC 
ACCCACACTG 
GAGGGGCAGG 
AGGAAAGCAA 
AAGCTTCACC 
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14 6551 ACTGGCAAGG TGCTCCTTGT 
14 66 01 TTAGCAGATA AT C C CAT AT C 
146651 GCCACTTCTG TACGAAATAA 
14 67 01 AGAAGAAAAG TTTCAGAAAG 
146 751 GTGAGTCTTT CTCCATATTG 
14 68 01 TATCCAGGTG TTCAAGGTGA 
14 6 851 GAAATAAAGG GCAGAGTTAC 
14 69 01 AGATGGACAG GACAATGAGG 
14 6 951 CGATGGAAAT GCTTCACTAG 
14 7 0 01 GAATATCAAT TTCTGGGGAA 
14 7051 AAATAGCTGC TTTCAACTTA 
14 7101 AAAAAAAAAA GTCTAATCTG 
14 7151 GTGATAAGAA AAGTGT CTG A 
14 72 01 GTGAAAACAT TTTCCTGGTT 
14 7251 AAAAAT C ATT TTACTAAAAA 
14 73 01 ATCTGTAGGA AACGGACAAT 

1473 51 CTTTCGCTGT CTCTTCCTTG 

1474 01 AGGTCAGCTA CTGCATTTTT 
14 74 51 TTCCAGCAGT GATGGTTCAA 
14 75 01 TTATAGAAAT ATTTCTGCAG 
14 7 551 CTCTCACCGT CACTTCACAT 
14 76 01 GGGATGTGGC CTTCAGGGCA 
14 76 51 ACCTTTGAAG AACAGGC CAG 

14 77 01 TGAACTGCAT AAACCTTGGA 
14 77 51 ACAAAACAAG GTACTGTAAT 
14 78 01 CTTGGATGAT GTCGATCTGT 
14 7 8 51 GAATCTGAAT AGAAAACAAT 
14 79 01 GCACAGGCAA GGT CTAGAAG 
14 7951 GGTTTATTCT ATGTAGTTCT 
14 8 0 01 AAGAAC C ATT GTGTTTGCTT 
14 8 0 51 ATTCTTGTGT ACGTCAAGAC 
14 8101 TCTGTATCAA CCAGAATAGC 
14 8151 AACTGCAATG CAAATAACAA 
14 82 01 AATTTGTCTT GCAGAACTGG 
14 8251 C AGAGTTT C C AGAAATTTTT 
14 83 01 GCACACAGTA GTTAGAGAGC 
14 83 51 ATCCTAAAGG AATGGAATAT 
14 84 01 CTCTTGCACC AGTAGT CAT A 
14 8451 CGCACCAGTG AGTAGCATTA 
14 85 01 CAGTGCAATA AACGTCCTGA 
14 8551 CTTTTTCTAT ATCACTTGAG 
14 8 601 CACTATACTT AATATAACTT 
14 8651 AAGATATCTC TCTTCTTTCC 
14 8 701 TGCTTACATA GAATTCACCA 
14 8751 TAGTATAACA CTGTAACCAT 
14 88 01 CTGGCTGTCC CTTGCATGGA 
14 8851 ACCAAGGAGA GTTCATTCAT 
14 89 01 GAATATTGTA C CAGTACTGA 
14 8951 AGGTCAGAGG AGAG CTGAAG 
149 0 01 TTGAACATAC ACTGCCAGTG 
14 9051 GCACTGGCAT TCTTCCACAG 
149101 CCTACAAATC ATAAAATAAA 
149151 TATAGCAAAC ACAATTAATT 



GTGAGCAACC TATTCTGCTC TATAAAACAT 
TCAGTCCTCA GTACAAGAGA CTCTGTGCCA 
GCCCACACGT ACTTTCATAG ACCTCAGGGC 
CAGTTTGTGC TGAGGAGATA GACCTTGGGG 
AGGCGGAATC CCTCAAAGAC AAGCAGCCCT 
TATTTTCAAC AGAGCAGGGA GGTAAAGAAT 
ATAGGATTTT TCAGTCAGAG GTGAGAGCTG 
TAAGGACAGT GTGACTGTGA GGAGAATTAG 
CCAAGGCAAG AAGAAAAAGA GTATTCAATA 
AGAATTCATC TCTGAAGGGC TACATAGGGC 
GAACAGGGAA ACTGAGGCAG CAGCAAAAAA 
AAACCCACAT CAGGTTCTAC TGTTGTTGCA 
TGAGTGTTTC TCAACCTTCG TTATCT CAC A 
ATACAGTTTT AGAATCCTCC AATATTACCA 
TGGAATCCCA CAAGAAATGA CTAATTTTTT 
AGAAAAACTC ATAAATATGA TGTCACTGTC 
GAAATTGTTT CTATTAGAGG AATCATAAGT 
TTACCCTCCA AATTGCAAAA GAAATGTTGT 
GTTGTAACTA GCCTGTTGCC ACAAAAATGT 
TCAGTTTTGT AAGGTT CTTG TATGGTATCA 
CCTACTCTGA GATGATTCAG TTCTTCCATA 
AAATAAATTG CAGAGTCATG AGT CATAAGA 
GCAGA CTATA AATTCAGCCT ACCCATCATC 

W gene exon L 
CAGACCCAGA GCAGCAGTCT TCCTCCCTGC 
TATTTCTAGA GATTATTTAT TTCACCTACT 
TGACAAATGC ATAGAAAAAA AAGGCAGAAG 
AAATACTTGA GGAAGAAATT TACTAAGATG 
GGGTAAGTCT CAGAAATATG CAGGAAGAGT 
CACTGGCAAA C ATGTAT AT C ATAGCAGAGT 
AAGTTAGATC ATTTGTTCAT GTGCTCTTCA 
ACAGT CAGTA CATCTTTATT TTATGGCTAT 
TCCCACTACA TACCTAGGGC TCTCAGCTTC 
AGAGCAGCAC CTGTGTTCTA CCAATAAGGA 
GAAGCTATGA TTCGGACTAG CACCATAAGA 
GAAAGTTAAA AATGGAATTC AGGATACTTA 
TGCTTGTCTA GTGCTTCAAT ATTTTCTTGT 
TTGTTCACTA CAGTTACGAG CTCCAAAAGG 
AGAAACAATC TCTCAGCATC TTCTAGCCCT 
TCATATGTCA CTCCAAGATG CTGTAAAGGA 
TGAACAAATA CAAAATGAAA TATGAGGCTG 
TATGGTTAGT GTTGTTGGTG AAACGGGATG 
TTTAGTAGCT AATCTTCCTC ATTTTCCATA 
ATCTGTAAGT TCCTTACCAT ATCATTATAT 
TAT AGT TATA AATTGCATGC TTTTTCTTTT 
CCTGTTTCGA TGAATTTTCT TTTGTTTCTC 
TATCTGTCTT TCTAAGTGAA CTTCTGCTGA 
GCACTACAAA CAGACGTAGT TGGCAGAAAT 
CAGGTCAGAA TGCTTTCATT CAGTTTCCTG 
AAAATAC CTG GCACAGTCTA GATTTTGCCT 
AGCCTCAGTC ATACAAAGGA TCACTGTGCT 
TCAATAAGTG TTAAAAATAC TTTTAGAAAG 
CTTCAAATAC TGAAAGGGAG CCTGCAGACA 
CTTAGCTAAT AACATCTTTT GTCCCTTTCT 
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14 92 01 GCGCAGGTTC AGAACAATGG 

1492 51 CTCTTGACTT TTTCAAACAT 

1493 01 TTGTTCTCCC CTCTCAGTAT 
14 93 51 AGCCAAAGGC AACACTGCAG 

14 94 01 TTGATCTACT GGAGTAACAT 

1494 51 CAAAGGAAAA CCTTATGCAG 
14 95 01 TAAGCAGGC C CACACCAAAT 
14 9551 TTTTAAATGT CATTGGAAGA 
14 9 601 ACTTGAGCAC TCACCAGTGA 
14 9651 CTCCAGAAAA TTCTCATTTT 
14 97 01 CAGAAAGACC CTCCACTGTA 
14 97 51 AAATCTACAA ACCCACAAGG 
149 8 01 ACAACCCCTT CTTTTGAGTA 

14 98 51 GTTCCTAAAC TCTGTGCAAA 
1499 01 CAGCCCCAGT GCAAACATGG 
149951 TGAACTGGCA TCTGCTAAGG 

15 00 01 CAATCCAACT ACTGAGCTAC 
15 0051 GAAAGGAGAG ATCTTCTGTC 
15 0101 GGCTTTTGAA AAATGTTCCA 
150151 AAGGTGCACA TGCCCAGAGG 
15 0201 TGTTTACCTG CTTTCTGACA 
15 02 51 TTTGGCAGTT TCCACCTGCA 
15 0301 TGAGTACAAG AGCCGATTTG 
15 03 51 TATATGAGAA TTCTGCTTGC 
1504 01 TGTACACATG GGCTCAAAAG 
15 0451 AGTCCCATGA CATTACTGAA 
150501 ATTGCATTTA TTACTGCAAT 
15 05 51 CTGAAGGAGC CAGAAACGTC 
15 0601 AGAACAGATG AGCGCCTATC 

15 0651 TAACCAAGTC TAATGATCAT 
15 07 01 ACCACAATGA TCATCTAGTT 
15 07 51 ACCACTAGAC CAGGCTGTCC 

15 08 01 TCCAGGGATG GGGCATCCAC 
15 08 51 CAT GAT C AAG CTACAGTCAT 
15 09 01 TCTCTGTTTG CCTTCCTCAA 
15 0951 TCCTGTGATA CCTTCACCTT 
1510 01 TAGGTAGAAT TGCAAAATTT 
151051 TGATGGGACT ATTCTCTATC 
151101 TTGAGATTTC TCTCTTTTTT 
151151 TCTTTCTTTC TTCTTTTTCT 
1512 01 TTCCTTCATT CTTTTCTTTT 

1512 51 GTTATTCAGG AACATGTTCT 

1513 01 AATCAGAACA ACAAAACTAC 

1513 51 CAACAACAAC AACAACAGAA 
151401 CTAT CCATGC TTGATACATT 

1514 51 TTTAGTGGAA TAAATT GGCA 

1515 01 TAAATCTTAA AGAAATGGGA 
151551 AGACTGGCAG GCATCTTTTG 

1516 01 AATTTGGAAG TCAAACTCTG 
151651 ATGCAGCCAC TGTGGTAAAA 

1517 01 TTTTTTCTTT TTGTGAGTAA 
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W gene exon 1 
AAGCTTTAAA TAAAGCAAAC ACAAGCTTTG 
GAGTGTCAGG AAGATGACAA CAAGAACATT 
TTCATCTGCC CTGGCTACTG TGTATCTGGG 
AT CAGATGGC AAAGGTGAGT CTGAGAAGAG 
TCTCTATGAT AGAAATTTAG C ATGATG CAT 
GTCAAAAGAT ACAGTCTACA GTAGCTTCTG 
GGGAGCAGTG GCATTAGTGA CATTGTCTCC 
AAGAAGAGCT CTTAATCCCA AAGCT CAATA 
GAGGGAGACT CGGATTCTCC ACCTGCCTTG 
CTGTCTCATC TCTCTGGAAA TGGCC CTATG 
TCT CTAGTAA GTTTTGTGCT TTGCTAACAT 
T C AG AGAAGA AACACAGTCA GGGTGATAAT 
CGTACTTTAC AAGAAAAATC AC CTACTGAA 
GTTCATAGCT TCAAAAGGCA GCAATGAGAA 
TTACCTAGCA TGTACTGCGG GAGGGGCTGA 
CAGAAGAAGC TGCTCCACCT GCTAAGGCAG 
AGGACAAATG AGGAC C AG C A GGGTCACAAA 
AGGAAGAAGG GAAAACAAAA CAAAACAGAA 
AGGTTAGATG TACACCTCTG TAGCCTGGGT 
AAGGCATTTA GGGTATCAAT TTGCTCCCAG 
TGTACCAGGC TCTCCATTTT ACACCCATGC 
GATAACTGGC CCGTCCCAGG TATTACCCTA 
AAGCAGGCAA GTTCCTCTAG AAATACCAGA 
AGCCCTCATC TTAGTGTGCT CAAGACATCC 
TAAAATCTGT CTTTGTCTCT CTTCATCACC 
AGTTTTTACT GAAACAGCAA ATTTTCTATC 
TTCCACTGCA GGTACTCTAC TTCAACGAAG 
ACCACAACCA TAAGAATGCA AGTCTATTCC 
AAATCACCGT GCCTGTTTCC AGAAGGTA TA 
CRl-d 

AGAATCATAG AATGGCCTGG GTTGAAAAGG 
TCAATCCCCC TGCTATGTGC AGGGTCACCA 
AGAGCCACAT CCAGCCTGGC CTTGAATGCC 
MAR (0.852) 

AACCTCTTTG GGCAACCTGT TCCAGTGCCT 
GCTAACACCT TCCCTTGCTT TTATTTTCTC 
ATGCAGGGTA CACAACTGAT TAGTACAGCA 
ATGCAATACT TAAGACATGC TTCCCATTTG 
AACCTCAAAT TTGCAAAATT TGAAATTTAA 
AGTGGACCTC TTGATCTTCT CCCTTCAGCT 
TTTTCTCTTT CTTTTTTTCC CCTTTCCTCT 
TTTTCTTTCT TTCTCTCTTT CAATCTTTCA 
TAACTACTAC TGTTCAATTA GTTATTGCTA 
TTGAACAGGC ACAGTCCCTA TCTCAGAACA 
TCTCCCACAC ATTGGATTGC AGCATCAACA 
AAAAACAGCC AGAGAAGTAA CTTTTACAGA 
T CAGAAATT A GTTTCTATTT CATTTGAAAG 
TGTTGGAATT TCCTAAGGTA GACCTTGCAA 
ATTATTGTAT TCCCGAGATA TTTCTTTGAC 
TTAAAATAGA CATAATTTAA GAGAGCAGAA 
AGTGAGTAAG GTAGTTTTTC CTCACTGACA 
AGTTCCTCTC CCTACTCTTT CCCATCATTC 
ATCATTTCCC TGAAGTCTGT CCACAAAACC 
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151751 CCTGTGGCAG CAAGTTTTGA TAAATGGGAA CTTGGGTCTA CATTCCACAG 

1518 01 CTACGGTGGG AAGACTAATT TTGGGGACTA CGCCAACAAA CCATTTATGT 
151851 TGCACGAACA GGAGATGGAT TGTTTCTCAT GAGTAATGCT TGTCTGAACT 

1519 01 GTAAGAATTA TGGAGCGCTC TAGGCAGGGA AAAGAAACTG TTCTAATAGC 
151951 TTAGAAATTT AGATAG CTGT TCATGCTTCT GATTTTCTTG CAGTAACAAG 
152 001 ATGAATACAA CACAGGTCCA GTTTCTTAGT CCACTAATTC ACAGCTTCAT 
152 051 TTCCTTAAGC TGGTTTGACA GTTTGAGTCC ACATTCATAT AATTCTGTTA 
152101 CATAAATATA AAGAATTTAC TGCAATTACT ACAACAAAAA GCATTTGCAA 
152151 AATATTATTA TTTAGAGGTA GGTTAAAAAA GTTAGAGGCA AACTTACCAT 
1522 01 GTAATTAACT TTCATAAATC TTATCAGGAG TCACACAGCC AGGTCTTCAT 
152251 GTATAGTTTA GCAATTACAT TCTGTCTCTC TCTCTGTATG TACTTCATTT 
152301 TGCAACCTCC ATTTAAAAGT CCTTAAACAT TCTAAACAGT TCAAGCTTTT 
152351 ACTACTTGCA TCCCAGGGCT CTTACAGTGT CTATAGCATA TCTGAAACTT 
152401 TTAGTAATTT C AC AT CATT C TTTTAATATC TGTCTGAGTT AGTAC AC AT C 
152451 TTGCATTGCA GTAAAGGCAA CACCACCTGA ATAGCAGTAG TTTACATAGA 
1525 01 GCTGCATGAG GAAAGAATTT AGAAATTTTG AACTGTTTTA CAGAAAAAAA 

W gene exon 2 

152551 AAAAATGTAT AACCCTTATT TCCTTGTCTC CAAGACAGAA ATAGGCAAAT 
152 601 CAGGTAATAT CCATGCTGGG TTTAAAGCAC TCAACTTGGA AATCAACCAA 
152651 CCCACTAAAA GTTACTTGCT TAGAAGCGTC AACCAGTTAT ATGGAGAAAA 

1527 01 GTCACTGCCT TTCAGTAAGG TAGGTAGGCC ATTT ATT CAT GTTATCCTGT 
152 751 GTGTGTCAGA CTTTATGATC TAT CTATGAC AACAAACCAT AAATTATATG 

1528 01 CTTTCAAATA TTTTCATTAC ATCTGCAAAT TGTGTAATTA TCTTTAACAT 
152 851 ACTTCCTGTG AGGTTCTTCT TGAGAATTTA GATATCATGA CTTTTATAGG 

152 901 ATGTATATTT AATTTGTGTG ATTCACAGTT GTGGCTACGC AAAAACATTT 
152951 AAATTATGTA TTTCCAAATA AAAT CAATAC TATGTTCTTT TGACAATGCT 

153 0 01 GTGCTTGTAG CCTACACAAT TTTTATG CAT TCTCTCCAAT CGGCTATAGT 
153 051 TATTTATTGG CATTCACACT GGCAGGCAAC AAACATAAGA CAGATGTCTA 

W gene exon 3 

153101 TCTTGCACTG CAGGAATACT TACAGTTAAC CAAGAAATAC TACAGTGCAG 
153151 AACCACAATC AGTTGACTTT GTGGGAGCAG CAAATGCAAT CAGAAGAGAG 

1532 01 ATCAATTCCA CGGTTGAACA CCAGACTGAA GGTAAGCTCT AGCATCTCCT 
153251 CTCCCAGTTC TGAAGGAAGC AGTTTTAGTC TTGAACAATT TCTCTGTGCC 
153 3 01 CAAAGGCAGG TAAACAATTT AACTCAGAAA GGAAAAT C AG AACAGTTTTG 

1533 51 CTGAAGTAAT CATCTGCTGG CAAGCCCTTT CTAGAATTAT CTTTCACCAT 

1534 01 TTGAAAGGGA GAGGAATGTG GTTTCCTCTA TAAATCAAGG TTGTCATGTA 
1534 51 TTTATGAATA ATCTCAAGCT AGAAGTATGC CAAATCAGCA CTCTAAATTT 
153 5 01 CCTTGTCTTA TGACTT CAGA AACTACGCCA GCATTTACTC TGAAACAGTA 
153 551 AAGCTGCACA AATATGTAAA CGTTCTTTGT TTTTCTCTAG GTAAAATAAA 

W gene exon 4 

153 601 AAGTCTGCTG CCTCCTGGAT CCATAGATTC ACTCACCAGG CTAGTCCTGG 
153 651 TAAATGCGCT CTATTTCAAA GGAAACTGGG CAACAAAGTT TGATGCTGAA 
153 701 GATACCAGGC AAAGGCCTTT CAGAATAAAT ACGGTATGGT AACATACTGC 
153 751 CTTATATACC AGACTGCAGG TTGAAAAAGC AGTGAAAAAG ATGGAGGAGA 
153 8 01 TAAATTCCTG TCATTCTTTA AAGCCACATA GCACTAAAAT TAGTATATTT 
1538 51 AAAACATACG TTATATCCTT CTT AG C AC AT CTTCAGTACA AAG AC CGCAT 

153 901 ACATATGCTA GCACCCAAGG CACAAATAAA ATTATCAGAA GCCAGCTTGA 
153951 AACAAACTTC CATACACCTC TTAAAGCAGG AAAAACATAG ATGTGAATAG 
1540 01 AACTGTATGA ACTAGTT CT A TAT AT TTT C A TTTTTAACCA TACAATGAAT 

154 051 TGGAGTGGAA CAGAGCTTCC AGTAAATACG TGTCATCCTA GCTGGCTAAG 
154101 ATAACCTTCC CAGCCTCCCA GTGCATTCCC AGAAGAGAGG GGCCCTCTGT 
154151 AGATCCTACA GCTTCTCTTA GAGCCACAGG GATGTACCTC CATGCTACTT 
1542 01 CAATGTAGTC TTTACTGTTC TGAGTATAAA TAGCAAGCTT TTCATTTGAT 
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W gene exon 5 

154251 TTGTTGCAGC ATACAACTAA ACCAGTGCCA ATAATGCACC TGAGTGATAA 
1543 01 ATTTAATTGG ACCTACATAG AATCAGCCCA GATTGATGTT CTTGAGCTTC 

1543 51 CATATGTCAA TAATGAACTC AGTATGTTCA TCCTGCTACC ACGGGATATC 

1544 01 ACTGGCCTAC AAAAGGTAAA GGGTAACTTT AAACTCAAAT TGCGTGAGAA 
154451 ACAACGTTTT CATGCATATC CATGGCAAAG CAATCCTGTT TCTAGGAAGG 
154501 AAGGTATCGA TAAGG CTAAA GGAAAAACAA ACCCCAAACT TGCCCAAATG 
154 551 TTATGAAGCT GAACCTTTTC AATGTTTTGT TTGGTTTTCT TTTTAACTCC 
154 6 01 TGGCACGTGG CACCTCGTGC TTCCTCATGT TGATCAGTGC TGGAAATAAG 
154 651 TAGCCCGAAT CCAACAAGAT AGATCTAATT CCAGCTGAAG AACAACGAGG 
154 701 ACAGAAAGAT AGTTCTGCTG ACTGT CTGTA CTGATTCGGA CAGATATTAT 
154 751 TACATTAAAA AGAAAAGCAC AAACTGGACA CCCTCCACTA CTTTCTGTGA 
154 8 01 TGTTTAGAGC TAATATACAT GTACACTGCC ACCTTCTGTA AACACACTGA 
154 851 ACCTGACTTC AGATAGTGAA CTACTGTGAA ATTCTCATTT ACATTAGTGG 

154 901 GTGTTTTGTA GAAAAAAAAA AAAAAAGTTA TTTTCACTAA ATTCTAAGAC 
154951 ACACAGAAAA CAGAAATGTG AGCAGCAAGT CAAATAGACT ATTGTTACTT 
1550 01 GACAGTGACG TTGTTTTACA AATATTTAAT CCCTCTATAT TCCCTGATGA 

155 051 TTACTAAGAA CAGTT CAAAT ACTGCACTAA CATGCTGTAG AGCAAAACAC 
155101 TCCTTCCTAG TAGAAAATAT TTCAGAGTTG GCATTTCACT AATGGTTTCT 
155151 GTACTTGAAA AGTACAATTT TTTTTGCTAC AAAAAAAAGC TACAGAATTT 
155201 TTGTAGTTTG AAAAGTTCTT AAATAAGAAT ATAAAAGAAA TAACCCCTAG 
155251 GGAACAGTTT TTTGAACACT CTGTAATTTT CTGGTTCTCT TTTCAATTAA 

W gene exon 6 

155301 CTGCAGCTAA TAAATGAATT GACTTTCGAA AAATTGTCTG CATGGACCAG 
155351 TCCAGAATTA ATGGAGAAAA TGAAAATGGA AGTGTATCTG CCCAGGTTCA 
1554 01 CAGTAGAAGA GAAATACGAC CTGAAATCTA CTTTGAGCAA GATGGGAATA 
155451 GAAGACGCTT TCACTGAAGG TCAAGCTGAT TTCAGGGGAA TGTCAGAGAA 
155501 CGCTGACCTG TTTTTGTCAC AGGTTTTTCA CAAGTGTTAT GTGGAAGTCA 
155551 AT GAAGAAGG CACAGAGGCA GCGGCTGCCA GTTCAGCATC TCTAGCGTCA 
155601 CGAACCCTTG GTGCTACAGT TATTTTTGTA GCAGATCACC CTTTTCTCTT 
155651 CATTATCAGA CACAACAAGA CCAAGTGCAT CCTTTTCTTG GGAAGGTTCT 
155701 GCTCCCCCTA GAAAATCAGC TATTAATAAA CAAGCCCTTA CAACAACGAT 
155751 GAACACAATG TATGC CATGA AGAACACCTT GACAGACTTT GCACTTTACC 
1558 01 ATTTTCCTGT ACTATTGACA ATCTCTTTTA GAAGAGAGCT CAAATTAAAA 
155 851 ACATGAATTC AAACCTCTGA TTCCTTTTCC TCTGCAAAGA AT CCTAGCAT 

155 901 CGTATACTGC ACTGTAGAAC ACTGAACTGC ACGCTGAACA ACATGGATGT 
155951 GTCTTTTCAG TGCTGTCCAA ACCAGAACTG CTACAATGCA GAACAGACTA 

156 0 01 GGCTGATCTA AACAGTACCT TCTGACCCAG TTCCTTTCAC ACGTAAGAAG 
156 051 AAAAGAAACA GGAGAAACTC ATTCCTGCAT ACAGCTGTTT CATCTCTTCA 
156101 AAGCCAGCTG TCCCAGGCCA GCTCAATCAC AGCCTTGTCA GTTTTAAATC 
156151 AGCTTCACAA CAT AG C ATGG CTGGTAATGA AACAAAAGTG CAAAAT CCTC 

1562 01 TGTGTTGCTG ATACTGGTGG TTTGCTCTTG CACACAAAGG AGCTAACACA 
156251 TGTACTTTCT AATCTCTGTC CCTCATAAAC TAGCAAATAC CAAACAATAC 
156301 AGAACCAGAG TAAAGTAAAA TACATACCTT GAAATGCTTT CTTTTGTCAT 

1563 51 AACCTTTAAT TCATTCAACG CTGTTGCAGC CCAGCACTGC ACTGCTTTAC 
156401 TTGCCTTTTA CTTTGCCACA TATTTTGCTG CTTGGAGCAA GTGGGAGAAT 
156451 AAAGTCTGTT ATGTTAACTC CCTAAGTGCT GTCTAAAAGA TTACATGCAA 
1565 01 ATTCTCCTCT ACATATTCAC TGCTTTCACA GCTTTTACTC CTAAAGGGGA 
156551 GGAATTCCTA ATCAGTCATG CACATCTAAG AACACAGGTG ATGCTCCTGT 
156601 TTCTCTGAAT TCAGAACAGG GAGGAAAGGA CTGGGTCTCT TAACAGCACT 

MAR- like element 
156651 TGCACACACA CTGACAGCAT CTCACTAGAA ACATCCCTTC CCAGAAAGGT 
156701 AGGATACCTT TTTCCTGGCA GAGGGAAGAG CGCTGACTGA TAGTGAGTCC 
156751 TTTCTGTATT ATTCCACGTG ACCAACTGTG GCCAGGCTCC CTTTTGGCTC 
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1568 01 TGCTTCCCAA ATGGGAAGGA ACGTAGGGAA GGGCCAATGG CAACCAAATT 
156 8 51 AGAGGGTGAG TCTTGCATTG AGGAACACCA TTTTCCCACC GTAAGTAGCA 

1569 01 CAGCACTGGG GCAGACTGCC CAGAAAAAAA TTGAGGATTT CCCATTCTTC 
156951 AAAGGGCTGT AGCTGACCTA ATTTACCAGT GGGTCTGCTC CAGGCATGAG 

1570 01 CTGGACTATG GAAATACCCA TAAACCAGCT TGTGTCTTGT TTCTATCAAC 
157051 ATCCATTCTA CCTTACCACC TCAATTTCTC ATCCTCTCTG GCACCCTTAC 

MAR- like element 
157101 AGCTTGACAA GCAGGGGCAG TTAGCTTGCT TGCTTGCTCT CAAGCATATT 
157151 TCTTTGAGAC TTGGAATCTC CCTAGCTTGT ACTTTCCCAT CAATCAATCA 
157201 CTCAGTGGAC CTTTCTTCTA CCCTGATTTT TTGACAACTC CTCTCATTGT 
157251 ACTAAAGTCA CCACTTGTTT AGTTTCTGGC ATTTCTTCAT TGCCTTGCAT 
1573 01 AATACTGATT TCTATAACTT TTCAGAAGAC TTGACCCGCT CTGTTCTTGT 

1573 51 AATCAAGCTT ACAGCAAAAT GCTTGGCACA CCATACAGTT TCCTTCCCTT 

1574 01 CCTCCTCCTT CCTTCCCTGT CCCCTAGACT CTTACAGATA TTGCCACTAC 
15 7451 CTTCCCCTTC TCTTACCACC AAAGAGCTGC TCTGCTGTCT TCTGGGACAG 

1575 01 CAGAGTGACA ACTATTGTGA CATATTTATC CCTGTTGTAT TGTTTTCACT 
157551 CCCTCCTTGC TCAAGCTCTG CCAGTGGGGC CGTTTCAGTT CTTGCCCACC 

1576 01 TCCATGGCAT GCAAGTACTG- CACACTAACT CAGTACTCTG CAGTGCTTCT 

1576 51 CTGAAGCGTT TCCCAGCCAT GGATGCAATG AGATTCTTTC ACCAACACAA 
15 77 01 GAAGCAAGGT ATTCAACCTT ACACCTTCAA TGGCTTTGCC TCTGCCTATG 

1577 51 CTCCTAATGA TCTTCACTCT GAAGCTTGAC TCTCAGAGTC TCATTCAGTC 

1578 01 AGTAAGCTCC TGACATTTTA TGCTGATGCA TCTTACTCAT AG AAT CAT AG 
1578 51 AATGGCTTGG GTTAGAAGAT CATCTAGTCC CAAGCCTCCT GCCAACCACT 
15 79 01 AAATCAGGCA CTACATCAGG CTGCC CAGGG CCCCATCCAG CCTGGCCTCG 
15 79 51 AACATCTCCA GATCTCCAGG GATGGGGCAT CCACAGCTTC TCTGGGGAGA 
158 0 01 GGTTCCAGCA CTTCACCACC TTCTCAGTGA AAACTTTCCT CAACATCTAA 
158 051 TCTAACTCTA CCTTATTTTA GTTTTAAAAC ATTCCCCCTT AACCTATCAC 
158101 TACCTTCCCA TGTGAAAAGT TGATTTCCCT CCTTACTTAC AAGCTTCTTC 
158151 TAGGTACTTG AAGGCTGCTA TCAGGTCTCC CCTGATCCTT CACTTCTTCA 

1582 01 G AC TG AG C AA GCCAAGCTTC CTCAGCCGAT CTTCAGAGAA GAGGTCTTCC 
158251 AGTTCTCTAA CCATCTTCAT GGCCCTGCTC TGGACTTGTT CCAAAGGTCC 

1583 01 ACATCTTTCC TGTGCTGGGT GCCTCAGACC TGGACACAGT ACTCCAGATG 

1583 51 GGGCCTCATT TGGGCAGAAC AGAGGGGGGA CAATCACCTC CCTCTCCCTG 

1584 01 CTGGTCACCC TTCTTTTGAT GCAGCCCGGG ATATTGTTGG CCTTTCAGGC 

1584 51 TT CAAGAGCA CGCTGCTGGC TCATGTTAAG TTCTTCATCT GCCAGGATCC 

1585 01 CTAAGTCCTT CTCAGTAGGG CTACTCTCGA TGAATTCTTA TCCTAGTATG 
158551 GAT ACATAT C TGGGATTGTC TCAACACAAG TGCAACACCT AGCACTTGGC 

1586 01 CTTGTTGAAC CTCATTATGT TCACATGGAC CCAATCCTCA AGCCTGCCCA 
158651 GGT CGCTATT AAAAAAAACA AGTGT CATGG CTTCACAAGG CTGGAAAGTT 
158 7 01 GGATCAGGCC TACAATACCT GCTAACATCC AGAAAC C AAA ACCATGCATT 
158751 CTGGCTCTGT AATCATTTTA CTAGATTTAT TTAGTTTAAA CAACAGGCCA 
1588 01 GTTGTCTTCA CAACAACAGA AGATCACTGA AATGAGTGAG TGACTTGTTT 
158 8 51 TACTGTCCTC TCAGTAGACA ATTGTGGTGT AACAGTTAAA ATGAGATTAT 

158 9 01 GATCCACATT GTTCCTTTGA AACGCCTAAA ACTAAAACAT AACATGATGA 
158951 ACAAGGAAGA CAAAA.CTGTA TGAGATCTTT TTGTGATTCA TTAGAAGCTT 
1590 01 TGAGTAGGCG GGAAC AGTGT TGACATAGGA GAGAAGGAAA AGGAAGTGCA 

159 0 51 AACTGTACTA TATTTCTAAA TTTATTCACT G C ATAACAC A CAGGCACAGA 

MENT exon L 

159101 ACCTGACTGA GGACAAGACT CAGGTCTTCT CTCTCACGGG ACCACAGGTA 
159151 AACATTTAAA CCAATTTAAA TAACTTTTTT GATGTTTTTA AATGGTT AT C 
1592 01 TATAG CTTGT ATGACAATGT AAGTATATTA AAACACAC C A GAGTTATTCT 

1592 51 GTAGTCGGGA GCATAATTGA TCACAAGAAG GAAAATCTTG TCAGGACAGT 

1593 01 AGCTGTCTTA CTAATTAAAA TGTT CAGTTT GATAAAGGAG TCTCATACTT 
1593 51 CAGGTAAAGC AAAGGCCATT TTCATTCTGC CTTGTATGAG GTCAGGCCAG 
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159401 GGACTCAGAG GAGCAGAGTA AAACAGACAG ATTTCTATCC TGATGCTCAT 

15 9451 TTGGT CAGGT TCTCCAAGGA GAGGAAGT C A CTCTGTTGGT ACAGATTTGG 

159501 TGTAGACTGG ATAACGACTG CCAGAAAAAC TGAAGTGGTT TGGTAACCAA 

159551 AATCTTGATA AATAT CTGTG GACTT CAGAG ATTGTCCTGC AATTTCTGCA 

159601 GTGCCATTCA ATAAATATAA ATCTTTCTTT ACATAATAAT AACTACTACA 

159651 ACAACAACAT TTTCCAGTCC CTCTATCAGA AAACAAC AT C AAGAAGGCAC 

1597 01 TACTGAACAG GTAAGTTAAA GTTTGGAATG CTCATAGCTT ATATGCATAG 
159751 GTATTTGCCA GTTTCTGGGG AAAATAAAAT TGCAAGAATA TAAAGAAGAG 

1598 01 ATTGTAGTTA GACTTCGTGA ATAAAATGGT AACACTCTAA AAGCAAATAA 
159851 CAACTTTGCC ATACATTATA TTAT CTGAAA TGGGTGACTA GCCAGAAAAA 

1599 01 TTCCATAAGC CTAAGAGTTA C AC CTAAATA CATTCTCAGT ATCAGCTCCT 

15 9951 AATTCTATCT AGATC CAAAA TGAGGTAGTG AAAAGTT CAA ATGTCCCATG 

1600 01 TACAAAAAAC TACTTAAACT TCCCTAGGAA CATTACTTTG ATAATGAGTT 
160 051 AAGAATGAAA ATGAACAAAA TATGCAGCTT ACAAATCCAC ACACTTTTGA 

16 0101 AAACCAAAGG CAGAAAGAAA CACAAATAAA AGGGCAGATC TATAAAAGAG 
160151 G AC AT AT C T A T AAT CAT AG A GAAATATGAG ATGGATAACA AAAACCTAAA 
160201 AGAAACTG CT GCTCCCAGCA GGTGGCACAT GGTATGTGTA GAACATATAA 
16 02 51 CGTACAACTA GGCTATTAGT TTCAAAAGGT ACCTACGTGC TCCGTTGCAA 
160301 ATGTAACATG TAAATGTAAA ATGTAAATGC AAATGTAACT AATATGCACT 
160351 AC ATACAT CA TTTTAGACAC T C AAATACT A CAATTCTGTC TGTTGCCTCT 

MENT exon 1 

16 0401 TTCCAGGCTG TAGCAATGGA ACAGGTCTCG GCATCAATTG GCAACTTTAC 

16 0451 AGTTGATCTT TTCAACAAGC TGAATGAGAC CAACAGGGAC AAAAACATTT 

16 05 01 TCTTTTCCCC TTGGAGCATA TCATCTGCTC TCGCCCTGAC ATATCTGGCT 

16 0551 GCAAAAGGCA GTACAGCAAG AGAGATGGCA GAGGTAAGTA GCTCTGTGAA 

1606 01 GCT ATGATG C TCAACACTGC CCAGCACTGC TGTTGAGATG CCCTGCTCCG 

16 0651 TTGTCATAGG GAAAAACTAC ATTTGAGTTT GCACAAATGC ATTGCTATTG 

16 07 01 CTGAGTGCAA TGGCTGTGGA AGGGATTT CA GCCTTGTAGT G C AC AG AC AG 

160751 AAGCACTGTG ATGATGCTCA CAGGCAGGAG CAATACTATT CCTTGTTACT 

16 08 01 GTAGGGGATT TACATATACT AGAGCTCCAG TGTCCCTCTG ATTAGATCAG 

16 0851 AAGATAGCAC AGTGTGTTAT CAT AAGGAT C CAAACAAGAC AACCATTTTA 

16 0901 TCTCTTTTAG GTTTTAGGTC ATGCAAACTC TTTCATGT CA GTTTCTTACC 

MENT exon 2 

16 0951 TTTGGAAACC CTGTTTGCAG GTTCTTCATT TCACTGAAGC TGTGCGAGCT 

1610 01 GAAAGCTCTT CTGTGGCCAG ACCTTCTCGG GGGAGACCAA AAAGAAGAAG 

161051 AATGGTATCT ATTAAACATG AAATCCCAAG ATAAGAGTTC AAATGT CTGG 

161101 ATATAGTTTT TAAGAGT CCA CCATTTCTTG TTTGCAGCTC TCTTTATGTT 

161151 TAAAGTATAA AACCCAATAT ACTTCGCATC ACATCCAATT TCAGTTCCCT 

1612 01 TCACTCATTC AGACT CAAAA GTATAGAAGC ACAAGTCACT GGTATAATCT 

1612 51 GAAAGGATTG CAATATGGTA AATCAGTTAA T C AAAT CAT A AAAGGAGCCC 

1613 01 TGCAAACTGC AGTGGTGGTG AATTTGGAAA GATAAAAAGT AAGAGAGAGG 
161351 AACAGAAATT CTTCCCCCAC ATCTACCCCT TAGCGTTTCA AAAACTTCAT 

1614 01 GCCAAAAATG CAACTGGTAA ATGTACAGTT TCTCTTTCCA AGGACCCTGA 

MENT exon 3 

1614 51 GCAT GAGCAA GCTGAAAACA TCCACTCTGG ATTCAAAGAG CTCCTGACAG 

161501 CCTTCAACAA ACCCAGAAAC AACT ACT CGC TGAGAAGTGC CAACCGTATC 

161551 TATGTGGAAA AAACCTACGC ATTGCTGCCT GTAAGTTGAA TGGTTTTATG 

1616 01 TCAAAGAAGA AAAAGAAAAA AAAAGAAAAG AAAAGAAAAG AAAAAAAGAA 
161651 AAAAAAGTTT TGCATTGTAT TCACTTACCA TTAATAGAAC AGATCTGAAG 

1617 01 CTGTCCATAA ATGCTGCAAA TGATGAGTCT TGGCTTCCAG TGATAAACTT 
161751 CAT TGGAAAA TACAATTTGG TCTTCTCCCA GTATATAAGA ATGCACTTGG 

1618 01 CTGTAATGCA GGACTCCTTT TCATGTAATA CAGCTTTATC ACTAGGAACC 

1618 51 TCAGTACATA CAATTGAAAA TGAGATATTA AAATACACAT ATCCAGGGGA 

1619 01 TTTGCACAGT CTTCCTTCCT TCTCCAAATA AAAATGGGAA CGAGAGAATA 
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161951 AGAGTATTTC CTTTGGTTAT TTCCTAACCA TTAACTCACT GCTCAATAGA 
162 001 GAGCAAAATG CTAGATCCTG CAATTGCCTG TGTGCAAAAA GTTAACAAGA 
162 051 AGTCCAGTAG CTAACAAATT ACTTTTTGGA CTATAAAAAT ACTGTACAAT 
162101 ACAGAATGTT TCCTTCTTTC GTTCTTTTTG TATGCCATTT TCCAGACATA 

MENT exon 4 

162151 TCTACAGCTC AGTAAGAAAT ACTATAAGGC AGAGCCACAG AAGGTTAACT 

1622 01 TTAAGACAGC ACCGGAACAA TCCAGAAAGG AAATCAACAC CTGGGTGGAA 
162 251 AAACAAACCG AGAGTAAGTT GAGCTCAACT CCAACATCCT TCCTCTTCCC 
162 3 01 ACTGTTCCCT TCGGGACCCT GTTCCCACTC CTGTGACTGT GG CAT C CAGG 

1623 51 TCATGCCCTC TGGTGTGGGC AGTAGATGGC TGTCTGCTTC CAGCTGCTTG 

1624 01 CCTTGAGACT GTGGCGTTTT TTCAGGCAGG AGCCAATTGC TGTCAGCTAG 
162451 CCAGGAGAAC TGGGCAACAA ACAGCAAACA GACTAACTGG TTTATGTCAG 

1625 01 GGAAGTAATC CAGGGAGTAG GGCACTGAGG CTTGCACTTT TTCACTAAGG 
162551 AGTTGAACTG AGTGGATAAA GAAT CAACAC ATTCCCTCAC TGTGTTACAC 

MENT exon 5 

1626 01 TGGAGTAAAG CCTGACTTTT CTGATTTCAA AAGGTAAAAT AAAGAATTTG 
162 6 51 CTGAGTTCGG ATGATGTGAA AGCCACCACT AGGTTGATCT TGGTCAATGC 
162 7 01 CATTTACTTC AAGGCAGAAT GGGAAGTGAA ATTTCAGGCA GAAAAAACAT 
162751 CTATACAACC CTTCCGACTG AGCAAGGTAA GCTCCTCTGG TGTCCTCCTT 

162 8 01 AAAACAAGCA GACTGGAGAC TGCACCCACT ACCATCTTTT ATTTCATCCA 
162851 TCCTTTAGGC ATTCCTTGGT AAACAGACTC TCTGAAAAGT TGTTTACAGC 

1629 01 AAAACATGTC AGTTGTCAGC TCACCAACAT TTATGGAACA TTAAGATGCT 
162951 GCTCAGGCAA AGGATAACTA GATC CAGATG GAACACAGTT TCCAAAAATG 

163 0 01 CTAGGGT CAA TTAAAGCCTT TTTGCAAGAC TGAGGTATAA GAGCTACATT 

1630 51 GTAAAAATCA GATATTAAGA GTCCATCCTT CCTGCACAGG AACTACATGC 
163101 TATGCTATGG ACGAGTGCAG TACCCGCGCC TCTGTGCTGC ACAATCCGGC 
163151 TGTGAATACA GCTGCTAAAG TATGGATGCA GCAGCACAGC TCCACTGGAT 

1632 01 GGGTGCATGG CCGAGTGAGA C TAG AAGT AA TGTTGCCAGA GAGGAGATCA 
163 2 51 CAAAAAGGCT GCACAACATT TATCCTCTCA CACCATAGCT GTTTCATTGC 
163 3 01 TGTAATGTTG GGTGCCTGTA TGCCATGAAT GCTCCATCCC CCTAATTCTT 

1633 51 GAAGATATTT CTGACTCCCT TCCTCTCTCC TCTGTGGGTT GATGTGCATG 

1634 01 TTCTGGGGAA AAGAGAACAT CAGTTAGCTC AGTCCCCAGC AAAATACTCT 
1634 51 GGGAAAAGAG CCAAGATCAG CAATATTGTC CAGTCAAGAA AAGCCTTGGA 
163 5 01 AAAAGAATGT CAAATCTCTG TTACAAAAGC TGCTTATGAA AGTTTCCTCT 
163 551 TTACAAGGAA TTCCTTTTTT CAAGGAATAA TTTTAACCGA TAAATAAATA 

MENT exon 6 



163601 
163651 


CCTTACAGAA 
TTTCCAGTTC 


CAAGTCCAAG 
TTATCATGGA 


CCCGTGAAGA 
AAAAATGAAC 

MAR 
GTATGTTCAT 


TGATGTATAT 
TTCAAAATGA 


GAGAGATACA 
TTGAGCTTCC 


163701 


ATACGTGAAA 


CGTGAACTCA 


CCTACTTCCT 


GACGACATCA 


163751 


AAGATGGTAC 


TACGGGTCTT 


GAGCAGGTAA 


AAAGTTCTGC 


TACATCCATT 


163801 


CTGTATCGCC 


ACTCAGTCAT 


CAGAACAAAA 


AGGACAGGCT 


GATGACCATA 


163851 


CGGCCCTTTC 


TTTCTTTGGC 


AGTTCATTCG 


GCAGAAGTAG 


CGCACAAAAA 


163901 


CTTGCAGCAT 


TATGTCTCAC 


ATTTGCTTTG 


CAGCCTGTTC 


TCTGGTCATC 


163951 


AGTAAAAGCA 


ATTTATATTT 


CATATTTTCA 


GCTGAATGTT AAATACGCCA 


164001 


TTTAAAAATC 


TGTTTAAATC 


ATTAAAAAAA 


AAAAGACAAT 


CATAATTAAT 


164051 


TGGTTTATCC 


TTGCAATTAT 


CAAATTCCTC 


TCATTTCTTA 


AACAACAGCT 



MENT exon 7 



164101 GGAAAGAGAA CTCACCTACG AGAGGCTGTC AGAATGGGCT GATTCAAAGA 
164151 TGATGACAGA AACTCTTGTG GATCTGCACC TGCCTAAGTT CTCACTGGAG 
164 2 01 GACAGAATTG ACCTCCGTGA TACTCTGAGA AACATGGGAA TGACAACTGC 
164 251 CTTCACAACC AATGCTGATT TCAGAGGGAT GACTGATAAG AAGGATTTGG 
1643 01 CTATTTCCAA AGTCATTCAC CAGTCTTTTG TTGCAGTTGA TGAGAAAGGC 
1643 51 ACTGAGGCAG CTGCTGCTAC AGCTGTAATT AT AT CAT T C A CAACTTCAGT 
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TATCAATCAT 
T C AG AC AT AA 
CCAGTAGAAT 

TCAAAGGAAA 
ATAGCCATAA 
CCCAGACACC 
ATTATGCTCA 
TTGCTAGCCT 
AAGACTGAGG 
ACAACAATGT 
TCAAATCTGC 
CAT CTG CACG 
TAACTTACGA 
CCAGTAACAT 
ATTTAAACAA 
GACTCACGTA 
ATTTCCATTC 
AAGTGCTGAG 
TACAACACTG 
GACACACCAA 
CCAGTCTTGT 
AATCCTCAAC 
CACCAGAAGG 
TGGGTGGATA 
GCAGCTGGGG 
TGCTGGAAAA 
CAGATTGGCC 
GTATCCTGGG 
ATTCTCCCCA 
CCAGTTCTGG 
TCCAGTGGAG 
AAAGAAAGGT 
GAGGGGTGGA 
ATGAATGGAC 
GCTAAAAACT 
TTTATTTTGA 
GGAGTCTCCT 
GCAACCTGCC 
CAAGAGGTCC 
CTGTAAGCAT 
CGCCCCATGG 
CAAGAAATAA 
AACATCCAAT 
GGCAGTAGCT 
AG C CATTATG 
ACAAGATCTG 
AAAATT C C AC 
CTATTTGGAA 
CTGTACACTG 
CCAGCTCAGG 
CAGCTGCATG 
GTATAAGTGT 
TAAAAACACA 
ACTCGAGAGG 



GTTCTGAAAT 
C AAAT C C AAA 

AAATTATTCC 
AAATGTGAAC 
GTACTGGAAG 
ACCACGCGCC 
TGAAATTGCC 
CTGTTCACCT 
TTGATAAGTA 
GCCTTATTGC 
TTCCCGTTTT 
GTCTCCTTGC 
GTGCCTTTCA 
TTTTATGGAT 
TAAATGAATT 
GCTCTTTTAC 
TTGTTATTTC 
GCACTCTAAT 
AAAACACTGT 
TTCAAATCAA 
TCAACAGCTT 
AAGTTTGCTG 
CTGTGCTGCC 
GAAACCCAAT 
AGGAATAACT 
AAACCTCTGA 
ATGAGCCAAC 
GTGAATTAAT 
TCTACTCTGC 
GCTCCTCAGT 
GACTGCCAAG 
TGAGAGACCG 
TCTCATCATT 
CCAGACTGTT 
GCAGTGCAGA 
GAGTGACAGA 
TCTCTAGAGA 
GTAGGGAACT 
CTTCCAACTT 
TCAGTTTCCC 
ACCTACTGAG 
AAGTCTGACC 
GCTGTATCAG 
GTGCTATCTT 
ATGCCCACCT 
AATGCTGGAG 
CTGTAACTAC 
ATATATCTCC 
TGCTGCTTCT 
GTTCAGCTGT 
CTACTGACTC 
GGAAAAGCAC 
AAGAAGTTAC 
ATTGCACGTA 
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TTAAGGTTGA 
ACAATCCTGT 

TCACTCCTAG 
TGTAGTATTA 
TCTATGTCTT 
TCGAAGACTG 
TTTTATAGAA 
TGCGTTTTGT 
TCCCAACCAG 
TACCTCTTAC 
CATTAACATA 
TACCTGGATC 
GCTAAGATAG 
GCTTCACTTC 
TGGAACTGTG 
AGCATCCTGT 
AAACCAACCA 
AAAC CAGGGT 
CGTTCAACGT 
AGCCTGTGAT 
CACCAATAGT 
ACAATATGAA 
ACTCACTGAG 
GAGCTTCTAC 
GCATGCATCA 
AGAGAATAAG 
AGTGTGCCCA 
AAGAG CGTGG 
ACTGGTGAGG 
TAAAGGAAGA 
TTGATTAGGG 
CAGACTATTC 
GTTTATAAAT 
TTCAGTGGTG 
ACGTTCTATA 
GCACTGGAAC 
TATTTAAGAT 
GCTTTAGCAG 
CTATGATTCT 
ATTGTTGACC 
AAAGCTCATT 
AGAGCAAATA 
CAACATCCAT 
ATGC CTATAA 
ACCACACGTC 
GGAAGTACTT 
AGCTCACTCT 
ACCCAACAAA 
AAACTCAGTG 
CTGTCACTGA 
AGCTTTGCAA 
ACAAGTGAAA 
TGTCAGGATG 
GGGGTGTCAA 



TCACCCTTTC 
TTTTTGGCAG 

AGGGATCCAA 
AAAGCTCAGC 
TTTCCTTAAG 
TCTCTCTACT 
AGCAAATGCT 
CCTTATTTCT 
CAAAAAAGAC 
TGAAATGTGA 
ATTATATGTT 
ATTGATAAGT 
TCCCGGTATT 
ATTATTTTGC 
TTTATGCTAT 
ATAATGGGTG 
TCACATCACC 
CTTACTCCCA 
GT TAT C GT AG 
AACAGAGTTA 
CTAGATAAAG 
GCTGGGAGGA 
AC C CAGAC AG 
AAGAACAAGC 
CT AC AG AT C A 
CTGGGTGTCT 
TGTGGCCCAG 
CCAGCAAGTT 
C C AC AT CTGG 
CAGGGAACTG 
ACCTGGAGCA 
AGCT CAGAGA 
AT CTAAAGTA 
TGCAGCACAA 
CAAACATGAG 
AGGCAGCCCA 
CTGCCTGGAT 
GGGTTGGACT 
GTGAT CTGAA 
ACTGCTTATT 
TTCTTCTGCA 
GCCCAGAAGA 
TTCTTCCATT 
GGAATTGGAT 
ACAGGACATT 
TAGAAATATA 
ATGCCTCAAC 
GTATATATAC 
CCCTCCCTCA 
GGACCCACTT 
AGAATTGCAC 
ACTACACAAT 
GAATAGTCAG 
ACTTCAATTA 



CACTTCTTCA 
ATTCTGCTGC 

AGTTCACTTT 

CTTCAATCAT 

TAAGG CAGC A 

GCTCCTTTCC 

TGAGGTACAA 

CTAAACTCTC 

CAAGAAAACT 

CCTAAACAAT 

TCCTGGCTAA 

GTATGATTTG 

GACAGAAACA 

CATGATCTAC 

GCAAGATTCT 

GCTGACACAT 

GCTAACGACA 

CTAGATTTCA 

ACATATACTA 

AGGCATTTGC 

GGATAGAGTG 

GTGGTTGATA 

GTTGTAGAGT 

ATAGGGTGCT 

GAGGCTAGCC 

TGGTAGACAA 

GTGGCCAATG 

GAGAGAGGTG 

AGTACTATGT 

CTGAGGAGAG 

TCTCCCATAC 

AGAAAAGACT 

CAGGAGTCAA 

GGGGGCAATG 

GAAGTACTTC 

GAGAGGTTGT 

GCTTTCCTGT 

GGATGATCTC 

CTTGCTTTCA 

GCACTATCAG 

TTATGATACA 

GTGGCTTCCA 

AG CTTAGGAG 

AAAGTTTTGT 

ATTTCAAGCA 

TACTC CTAGC 

AAAAGGTAGA 

GCCGCCTTCT 

CCTGTAACCC 

TGGTTCAGCC 

ATTGTAAATT 

ATAAGTAAAG 

CATAGCAAGG 

AAAAAAAGTT 
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167101 AAGAATTGGA ATTCTCCAGC TTGTTCAGGT CCTTCTGAAT GGCAGCAAAA 
167151 CCCTCTGGTG TATCATCCAC CCCTCCAAAT TTTGTATTGT CTGCAAATTT 

1672 01 GGTGAAGGTG AACTCTATCT CAGCATCCTG GTCATTAATG AATGTGTAAA 
167251 TAGTACTGAC ACTGGTGTTA ACCGCTGGAG TACACCACTA GTGACTTCAC 

1673 01 CTTCAGCTGG ACTTCATACC ACTGATCACA GCTCTCTGGG CCTGGCTGGG 
167351 CTCACTTATC TAACCCATCG TTTGTCAGCC TGGCCATAAG GAT AT CACAG 

1674 01 GAAAAAGTGT TAAATGCTGT TTCTAAAGTT AAGGCAAACA ACATTTAGTA 
167451 TTCTCCCCTC ATCAAAGAGA CTAGCTGCCT CATGGTAGAA GGCAATCAGG 
167501 CTGGCTAAGC ATGACTTCCC CTT CATAT AT CAATCCTGAC TACTCTCAAT 
167551 CTCCTTCTTA CTCTTCATGA GTTTCACAGT AGTGTCCAAC TCTGTGGTAG 
1676 01 CCTGACCTCG CAGCAAGCAG AAATGGCAGA GAGAGTACAG AATAATAAAG 
167 651 ACAAGGAATG TTACAGTGCT GGTGTATTTT TAGTACTCAA GGCTTGGATG 
16 7701 CAGAAAAAAA C C AG CAACAA AACAACCTAT TTTAGAACTA AAATG CAAAT 
167751 GTACCAATAT TTTTTAAGGT TTTTATTTCA CCAT CAGTAA CTGGAACAAG 
167 8 01 AAAATCCCTT TTAAAATATT ATTTATACTC CAGTCACTTA CAGGTCTATT 
167851 GCAACCACTG TGATAC TTAT TCTCCTTTCC TCTCTTGTAG GACAAGGCTG 

167 901 AATGGGACTT CCCACGGAGT AAAAAACTGT AACTGGACTT AATCTAGCAG 
1679 51 TGATTTTAAA TTGTGCTGCT TGTTAACCAG GCATAAATGC CTAGATGTAC 

168 0 01 CAAACTATAG AGTAAACTGA TCACAGGTAT AGGAGTACAA CAGCAGAATA 
168 051 GAAAGATACA GCCTCACATC CACACTGAGA AAGGTGAGGA TGAAAGATGA 
168101 ATTCATGAGA AAGCCCGAGA AAAGTGACTT GCTTTATCAG AAAAATTTTA 
168151 TGACCATTTG TCAGAGAGCA ACCTATTTGT GGCCAGCAGG GGTTTTGATG 
16 82 01 CTGCAACACA TTGTTT CTAC CATTCTGTTT GCAGTAAGCC ACTCTTTCCT 
1682 51 CTTCTCCACG TGGCACAGCA AAACCAAGAG CACCTTTTCT TTCCCATGTT 
16 83 01 TTCTCCCCTT TGAAACTCTA CTCCCTCTTA CTAAACAGGA TCTTGCAGCA 
16 83 51 GATACCATAG AGTGTC C ACA CAGTTTGGGC ACCCTTTTGT AATTTTCCTG 
1684 01 TATTTTTGCC CAATCTTTGT AGTTTGTAGT ACAGTAATTA TTGCTATTAA 
16 8451 CTGCTTTCTT ACCTCTCCTA AATATTCAGC CCAACAGGTA ACAGCAGTTT 
16 85 01 CAGATATGTA GCAGGAACAT TTCTTAACCA CATAACTAAA CATAATAGCA 
16 8551 TTCGATAGAT TTCTGAAGAA AGAAAATGAG ATTGATATTC CAGGTAGGAA 
168 601 AAATGTG C AC ACTTAGAGAG AGAGCATGTG AAAAAAGATA GGTATACATG 
168651 TCACTCCTGA AACCTTGTCC TGTGTATGTA TCAGTCACAT TCAAAGTGAA 
168701 TAGCACCATA AAAATTTAGT TCTAGTTCTA GATATGGTTT TGTACTCCTC 
168 751 ATACAAATGC TTTTGCACAT AGGTTGACTG CATGCTGATA ACTGGTTTAC 
168 8 01 TTTCCAGTGT CATAACTGTG AGCAGAGGGC ACTC CTAAGG CAATTCTGAG 
168851 TAACCAAACA TGTACTAGAA AATATATACA ATGATTGCAA AGGAAACTGA 
168 901 TCAATGCATT TCATCCAAGC CACGCAGATA AAAACTGGAA TTGCTAATTA 

168 951 CATTGGTTAA AATAAAATGC CTGTGTACAG ATAACTGTCT TGCATTATAT 

169 0 01 CTTTACTGAT CTGATTCCAT ATAACATGCT ACTTCGACTC TCTACTACAA 
169051 AACAACAAGC AGTTTCAGTA AAT C AAAT AA TTACAAGCAG AACTTCCAAG 
169101 CAGCTTGGCA GCTTTTCGTG AGTGGCCCAA ACTCCCTCCC TTCATAGAAG 
169151 ATAATAAGCT TGGCACAGTG CAGGGCCTGT AGGTTAT CTA AAGCATCTTC 

1692 01 ACCTCCTCTG CTAGGAGGAA ATGCATTCTC TTAAAGCACC ACCCACCAGT 
169251 TCGAATAAGT ACCATGCTCA CGTAGACTTC ATCACTTGCT TTAGGTGACT 

1693 01 GTGGGAATAC AGTCTGAGTC TCCTTGAGGA TCTGACTGCT GAGACATAAA 

1693 51 AGGATTAAGG GTAAAATATC ACAACATAAG GGCTTTGTGT GACCTGACAA 
169401 CCACAAGTAT TCAAATCTTT CCACTCTGAA ATTTGACAGC AGGATGAATG 

1694 51 ACACTAACAG CCTACTTCTG TACTCCTTTT C AGCTAAAT C TTTGAGATTA 

1695 01 GTTTCTCTTA ATTCCCTAGT TCATAGAGTA AAAAGAAAAG GAAACTGTAA 
169551 AATTATTTCA ACGTGTCCCT TGTCAATGCA CTGCAACTCT ATTCTTTTCT 

1696 01 GCAATTCCTA CATACTTGCA GACAAAGATT GAGGAGGGCA CCACTGGGAA 
169651 AAAGG CATAC CAGGCTTCAT GTTTCAGGAC ACACACCAAC TAACAGTGGT 
169 7 01 TTTGAATTGG CCTCTCTACC CTCATCTCTC TAATTCTGAA AG C CAGTGAC 
169751 GGAGAGAAGA AATGGGAATA AATGAGGGAA CAGGCATATT AATACAAGAA 
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1698 01 GAGGATGTTG GGAAGAGAAG 
169851 GTATTCGTTG CCTCACAGAC 

1699 01 AACTCTTCCC TCTCTCACAG 

169 951 AATAACCAGC TTTATTTTGA 

170 0 01 AATGGAGAGG GTAAATGGTA 
170 051 TCAAGTCTTC ATACAAACTG 
170101 ACAGTGTTCA GGATGCACAT 
17 0151 GAGAGCACAA GAGTTTCCTC 
170201 CTACCAGCTG ATTTAGGATC 

1702 51 TGCACTCCTA CTGATCCTGG 

1703 01 AACACAGTGT TTTAGACAAT 
17 03 51 GTTAAATAAC TACTGGATCA 

1704 01 AAAGTGGCTA TTAGACAGGC 
17 0451 AGCCCTAGAA ATGCTTGATT 

1705 01 TGG CAATAGT GGAGATACTA 
17 05 51 CTGAGCCCAG TTCTCTAGTT 
170 6 01 CTCCAGGCAC AGCAGTAAGA 
17 0651 TATGAGCTAG GCTGCAGGGC 
17 0 7 01 CCAAAGCTGT TTTGGAGCAC 
17 0 751 CCCAAAAGCT CCAGCTATCT 
17 08 01 CTGCTGCTTT TGATTT CAGC 
17 0 851 ATTCCACTCA TTCTCATTGA 
17 09 01 CTGCACCGAC AATGGAAATT 
17 0951 ACTAAATTCA TCTAAAATAT 
1710 01 CCAGCTATGA AAAATTACTT 
1710 51 ATCTTTGATT TTGTCCCATA 
171101 TGCAGGTTCT GATAAAC CAG 
171151 ATGGGAACTG TTTGGAGTTG 
1712 01 CAGTCATTAA CAAAT C CAGC 

1712 51 AAAGAT CAAA TAACAATCTA 

1713 01 GCAAATGGCA TGAATTCGCA 

1713 51 TAAGGAAGTA GGAATAAAAG 

1714 01 TTTCGCTCCC TCAGAATGGA 

1714 51 TTTTTAAGAA CAAGTTTGGT 

1715 01 CTGATGATGA AT AAG CGAAA 
171551 GTGATGGGAA AGGGAGAAAT 

1716 01 TGACTGTACG TGGCGTTTTG 

1716 51 TACAAGGGTG CAGGCAATCT 

1717 01 TTGTGAAGAT GCAGTCATCA 

1717 51 GTATTGATGG AGTGTCTTTC 

1718 01 TTACAAAAAG CTGGATGTAA 

1718 51 CTTGGAGTAT TGCAACTGCT 

1719 01 GACACAGCAA CCCAGATGGC 
171951 ACTTAAAGCT GGCTCCCACG 
172 0 01 ACAAAAAGGA GCAACATGAG 
172 051 AAGAAAAAAC ACTGCATGCA 
172101 TTCCAAGAGC CCCCTTCTCT 
172151 TTCTGGCAGA GTGAAGCTAT 
1722 01 CACCCAGTTA GTAAGTCCTC 

1722 51 AGCAACATCT TTGCTTGTGA 

1723 01 CCATGAAGAA GGGAAGGTTT 
172 3 51 GAGAATGGTG AGGTG CTGGA 

1724 01 CTTCC CTGGA GGTGTTCAAG 
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GGAGGTGCAA CCATTTCTAC ATTTCCAAAA 
CGCCACAGAT CCTGACTGAA GAAAC AT CTT 
GATTTCAGGT AGTGTAAATA TTCTTATCTA 
CAGTCTTAAA AATTCATATA AATTTTCCAG 
ACTTAAATGT CATGCAGGGA AATGGAGCAC 
CAGC CTTTAG GAAAACACTG CACAAAGGGC 
CCAGACTGGG CAAAACCTGT GCAGCACATA 
ACTTGCAGGC AGGGTGCCCT AATCAC CAGG 
AGAGGACTGA ACACAGAACG CAGATGATAA 
AGTTGAACAC ACGTGACTGC ATGACTCCAT 
CAGCACTGTC CAAGCCACTG CTTTATTCGT 
GCACCCAGCC TTTTCCTTTC CTGAATGACC 
AGTG CTGATA CAAACACACA TGCTCTCTCA 
GCTGTGTGCA AATTCAGTTT TGACACGCTC 
GTTTGAAATT CTTTAAGCAA AAACATTCAG 
ATGCTCTGGG AGTATCTGGC TGATCAGCTC 
AATACCAGGC TTAAGGTAAA ATGCTTAAGG 
ATTCACCTCT GAAGCACACC TGAGAATGAT 
TGGGAAGCTT TGATATCCAA ATTTAAGAAT 
CTAAGACTGA ACAGCAGCTG GTTTTGAGGA 
ATGCAAATTG TGAGTGAAGC TGGACTGAAA 
CCTAAGGCAA GTCTTTCACT TGATGCAATC 
TCTGCATTTA TCCTAATATT TGTCAACAGT 
GTTAGTTAGA ACCACGAACC TATT CTGGAT 
TAACCCTGGG GTTTCTATGT TTTTGTTATT 
AAGAGGGAAG CAGAAAGAGA ATGGGAAATC 
CACGAAAGTA GAACAGAGCT GTTGCTTAAA 
TATGTAAGAC ACAAGAGCTT ACTTGAAAAA 
TACAGTTGAC AAATGCAGGC CTACGCTCAC 
CAGATCTCCA TCAGCCTTCT AAAGCATGTG 
ATAGTTCTTT CAG C AGAAAA TAATGCAGTT 
CTCAGCGCAG CACAGTGGGA AATATTTCTA 
AGGATAATTC TAAGAAACTT AAGTAAATGC 
TTGGGAAAAC GTTCCTAAAA TAGTGTTTGT 
CGGCTGAAAC AAGAATATAA GCATTTAACA 
AGCGTAGGTC TTAAAAAGGG ACTTGCTATC 

Zl exon 1 
GCCATTACAG AGCCATCCCA TCAGCGTGTG 
CCTACCAGCA ACTTCCCATT TTGTTAGATG 
CCTCCCTCTG CTAATCATTC GTCTCAGGCT 
AGCATCAACC AACAGCTTCA CCCTGGACCT 
CTTCCAAAGG ACAAAACATT TTCTTTGCTC 
CTCGCTATGG TCTATCTGGG TGCAAAAGGT 
TAAGGTAAGT TCTGAACTTA GCAGTGTATC 
TGACTTGCAT TATTCACCTG CCTACGTGGA 
AGAACAGTGA TCCGCCATCA TGTTGGGTGC 
GTCCTGCAGG ACACTGTTGT TCCAACTTAT 
GCTACCACCT CT AC CTTTAG CTTAAAATTG 
GGACATTAGA GACTGTCTTC TCATTCTCCA 
TGAACCCAAG CTTGGCTGCA CAGCAGCAGC 
CATGGTTAGT CCATAAATAA GGTAATACAC 
AGATTGGATG TCAGGCGGAA GTTTTTCACA 
ACAGGCTGCC CACAGAGGCT GTGGATGCCC 
GCCAGGTTGG ATGTGGCCCT GGGCAGCCCG 
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172451 GTCTAGTATT AAATGTGGAG GTTGGTGG C C CTGCCTATGG AAAGGGATTG 

1725 01 GAGCTTCATG ATCCTTGGGG TCCCTTCCAA CCCAAGCCAC TCTACGATTC 
172551 TATGAATAAT GGGGATATAC TAGAAATGAA AATATGATAT CATTTATAAC 

1726 01 CACTTTTGCA AAACTTTCGG TGATGTACTC CTAGATTATA TGCATTTACA 
172 651 TAAATGCATT TATCTGTGTA ATGTACTGTA GAGTTGTACA TTGGTGCCTC 

1727 01 AATAGTAAGA CTACAAACCA TCCTATGTTG TTTGTTCTGC CTGATAACAA 
172 751 TCTGAAAATA AATT C C AC AT TGCTAAGCAT GAATTGAC C A TTTCTCCAAA 

1728 01 TCAATCCATG TCTGAGCAAT CACATTGATC TGTTATTAAG TAGTAAATGA 

172 851 CTAAAATTAA TATAACTATG ATACGGTTAT AGAATCTAAA TCTAGACCGA 

1729 01 GGTCTTGTTC TCTATAACTT TAATAGACTA ACATTTGTAC GATGGCTAAA 
172951 TTATCCTAAG TAGAAAACTA ACATCATTAC GTAACACTAG AGCACTTCTA 

173 0 01 TCTTCACAAA ACAAACTGTC CTTAAGAAAA TTTATCACTG CCACAGGTTC 
173 0 51 TTCATTTTAA CCAGACTGCA AGAGAAGAAA GTCCTTCTGA GATGACAGCA 
173101 CCTTCTCTGC GGAGC CCAAA GAGAAGAGAC ATGGTATTTA TTTTGACAAG 
173151 GCTCACAGAA ACAAATCTCG GAGAATGGGA CAGCACAACT GCCTTAACCC 
173 2 01 TCTGAAATTT GTCTGTCTAT ACCTACTGTC CTATTGCAGT AAGAACTATA 

1732 51 CATAAAAAAT GTGAACAAGC AGGTAATATT ACTGTAAAAC TCACGAACTC 

1733 01 AGAACTTCAA AGCAGAACAG AGACACCAGA AGCATTCTGA TTGTCTTTTA 

1733 51 CACTTTGTTA CTTGCTTTTA GCTGCTCTAT GGCAATAGGA ATACCTAATA 

1734 01 TTTTACTAAC TTCAGGATTT TTTTTTTTTT CTGGGCTGTA AAACAGCCAT 
17 3451 TGCCTTCACA T CTG ACT CCA GTCCTGCTCT CTTAAACTCT GTTCCACCTT 
173 5 01 TGCTATGCAT GCACACATCA GGCAAACAAA ATTCCCCTGT AAGCTAACAC 
173 551 ACACTGATTC ACCCTTGCTC ATCAGCTCCC CAGAGACTTG ACAGCAGGAG 
1736 01 GAACTGCACA GAACTCCTTA CGCCTCAAAG GTCTGTCAAC TCAAACAAGC 
173 651 TGCAATTCTT TAGCACAGAA ACTAAGAAAG CTCAAAGAAA CCAAGTTAGA 
173 7 01 AAAACTGGAA ATAAAAAGGA GCAGTTACAT CATTCTCCAT CTAGATGGCC 
173 751 GTGGATTAAT ATTTAAGATA AATCATTATG TCCACATTGA TTTTAGCTAT 
173 8 01 TCACTGCCTT TCTAAGTTGC ACAGCAAGGT CTG C CTGATG TAGCCTTCCA 
173 851 GACATTTCTC TCCATTACCT CTTCACTATG TACCCATTGC CTTGCAAAAC 
173901 C TAT CT ACTA TCCTGTTCTC TACCAAGTTC TTCCCCACAT GTCCTTTCTG 

173 951 AAAGCCTGTA CCTCTGCCTG TGTGAAAAAA TAC C AGAGGA AGGAATGCCT 
1740 01 CCTCACCAAT CTTATGACAA GCCCCCATGC ATCAGCAGCA AAGAATCTCG 

174 051 GTGTCTCACA TGTAGCGCAT GGTACATGCC ATGGAGCAGG AAATTATACT 
174101 GAAGCAGCTT ATCCCTACAC T ACGAAAG C A ACAGCTGACA AGCAAGCTCC 
174151 TGCTCCCTAA AACCATCACC AAGGACATTC TGGACAGGTT CCTTCCAACA 
1742 01 TCCAAGCAGT AACAGCAAAA TTCACATCAA ATAGAAAATT CGAATCAACT 
1742 51 CAATTACATG ACATCAGTAT CTGT CTGAAC AGAGTAGCTC CTCAAAAGCT 
174 3 01 GCAATGTTGC CTTAATGATT TTTGTGATAA TCAAAATTAG GCTTGACTGT 
174351 GACTGGAATG AGATGACCCA ATATC CTGGG TGCACCATCT GAGGACAGCT 
1744 01 GATGTCCTCA AAGGTGTAAG CAGCTCCGCA GAATCAAATC ATAACCCCAC 
174451 AGACATAAAA CAATGTATCC AGTTACACAC AGTGCTGCAA AAGTAC CAGA 
174 501 TACCCGAAGA AGAGCT C CAT CCCTGCACAC ACTTTTTAAA TTAAAATGAC 
174551 CTGGGGATTT TAAATAACCA TAGAAAGTGT AATGCTTCAG C C AAAAAT AT 

Zl gene exon 2 

174601 TTCAATACTA ATTGGTGCCT CTTTCCAAGG GTCTTGAGTA TGAGGAAACT 
174651 GAAAACATCC ACTCCGGCTT CAAAGAACTC CTGTCTGCCA TCAGCAAACC 

1747 01 TAGAAACACT TACTTGCTGA AAAGTGCTAA CCAACTGTTT GAGGACAAAA 
174751 CCTACCCATT ACTGCCTGTG AGTTGAACGT TTCTGCTTAA GAATGTTTTG 

1748 01 ACAAAGAGTA TTTGACGTAC ATATTTGACA AT C ATAATTT TCCAAATGTT 
174 851 CTGCCTTTTA AATCTACTGC AGCCTTTAAA AC TGTAAG AG TTCTACAGTT 
174901 GAACTACAGA AAGCTCCTGT CTCTTTTGAG CCTATTTCTT GTTAG CATCA 
174951 CTGTTGGTTC ACTTCTCCTC TAAAGGAAG C TTCTCCATAT CAGTGGT CAT 
1750 01 CTTCCTCCCC CTTCTCACTA TCTATTACAT TCTAGCATTT CTGAGATTGG 
175051 AGG CCAGAGC TATATAGAAA AGTCACACCA TACCTTTACA GAGCGGCATA 
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175101 TTAACGTTCT CCCTTTTATT ACACTGTCAT TTCCTTCAAC TTTTCACATC 
175151 ATATTTGCTT GTTTGACTAT TACTGGGCAG ATTATTTTTT TTTTTCATGC 
1752 01 AACTGTCCAT TAGAACTTCA ATACCTCTTT TCTAAAGCAA TAATCATCCT 

1752 51 TATTAGTATG CTTTGCTGGA TCATTTTCAG ATGTTTCAAT ATTGATACTG 

1753 01 TAATCTATTT CCTTCTTCTC TTAGTTTCTA GCCTTTTAAA CAAGATCTTT 

1753 51 CTCTGTAGTC CCACGGATAC TCAGCTTCTT CAAAAGCCTT TGGTGAGAAG 

1754 01 TCTTCCAAAA TGCCTTTTGG CAATCCTAAT TGGCTGTAGT GGTTTGACCT 
175451 TCTCATTTGC AAGCTTCTTT ACACCTTCAA AAACCTCCAA AAGATTCATG 

1755 01 ATGTAGGACT TATTGTTACA AAATGCTGTT CCAACACATC ATGCTTACCT 
175 551 ACGTGCCCAG TAGGCCTGCT TAT C CATTTG C C CTGT AC AG ACTCATGAGC 

1756 01 TTGTAATTCT GTGTATCTTT GAAAT CTTTT TAAGAAATGG CACTGTATTA 
1756 51 ATCATCCTCC AGTTTCTTGG AAGGTGCTGT TAAATGAGGT TAGCGTTCTT 
175 7 01 TCATCTTGGA GTTAAGTTTC AGTCTCCTGA GTAATTTCTC AGTCATTAAA 
175 751 TTCAATCAGC CTTACTGAAT GCTTGTTGCT CATGTTCTGC ATAATTTTGT 
175 8 01 GTAACCGTGG TTTAAGACTA AT CAGGG AAT TTCCTGCCGG G AATCT C CAT 
175 8 51 GAACAGTTTC ATAGAAGACA GTAATGTAAA AGTCAGTTCT AGTTTATTCT 
1759 01 ACCATAACAT TCCCTTCATA TCTGACCCTC TACTGACCTC ACAAACT CTT 

175 951 GGCAGCTTGA CTCTGAAGCA AAT C TTAAAA CATTGTTTCA ACTGTCCTTG 

176 0 01 AAGCTATTCT TTCTTCTTGT ACTGAACTCT GTCATAAAAC ATAAAAATGT 
176 0 51 ACTTCAAGTC AGTCTGAGAA AT C AAAAAT A ATTTAAAAAA ATGTGTAGAA 
176101 TGTTTATCTC AT AGGATTT C AAAATTACAA ATTTGCATGT TGGATTTAAA 
176151 ACACAAAACT TTCAAGCATC ATTTTTTGTG AAACACAAAT ACTGAATTTT 
1762 01 TGATCAGTCC TTGCTTATTA TTTAACACAG AATCATAATC ACAGAATCAT 

1762 51 AGAATGGCCT GGGTTGAAAA GGACCACAAT GATCATCGAG TTTCAACCCC 

1763 01 TCTGCTATAT GCAGGGTCAA C AAC C AG C AG ACCAGGCTGC CCAGAGCCAC 
176 3 51 ATCCAGCCTG GCCTGGAATG CCTCCAGGGA TGGGCATCCA CAACCTCCTT 

1764 01 GGGCAACCTG TTCCAGTGTG TCACCACCCT CTGTGTGAAA AACTTCCTCC 
176451 TAATATCTAA CCTAAACCTC CCCTGTCTCA GTTTAAAAAC CATTCCCCCT 

1765 01 TGTCCTATCA CTGTCCACCC TCATAAACAG TCATTCCCCC TCCAGTTTAT 
176551 ACACTCCTTT CAAATATTGG AAGGCCACAA TGAGGTCTCC CTGGAGCTTT 

1766 01 CTCTTCTCCA AGCTAAACAA GTCCAGTCCC CTCAACCCTT CTTCATAGGA 
176651 GAGGTGCTCC AGCCCTCTGA TCATCTTAGC AGCCAT CAT C ACAGAATGTA 
176701 TCAGAATTTG TTTTACGGGT TATCTAGTTT CAAATGAATT ACATTTTCTT 
176 751 CCAGTATGTA TTAGTATGTA TTGCATGATC TGTTGAGATG ATCTTTTTCT 
176 8 01 ACTATTTTTG TGCTTAAATT TAACTATATA AG CATACATT TTCCAATTCT 

Zl gene exon 3 

176851 ATCCTTCAGA AATTTTTACA ACT GAT CACA AGGTACTACC AAGCAAAGCC 

1769 01 ACAAGCTGTA AACTTTAAGA CAGATGCGGA ACAAGCCAGA GCACAGATCA 

176 951 ATT CCTGGGT TGAAAACGAA ACT G AGAGT A AGTATCGCTC TGATGGCTTT 

1770 01 TTCTTTTCTC ACTTCAAAAT CATTTGCATT TCCACTTGAA TTGCTCTTGC 

177 0 51 AGTAAGGGAT C C AT AAAGG A TGGAAACTGT GGGGAAATGA TGAACAAATT 
177101 GCAGTTAAAT GTCTTGAAGA AAGCCAACCA CCAAAACTAA CTGCTGCCCC 
17 7151 TTGCAAAGTT TTTCCCTTGA TTTTTCATGT CATAGTCTCT TCTGAAGTAT 

1772 01 TTCTGTTCAT AAGGAAGCAG AGTGGATACT ACATGGCTCC ACTCTGATCA 
17 72 51 GTGAAGGTTT TACTTCTGCA AGCTTCAACT GGTTGCAGCC AACTCCAGAG 

1773 01 AACTTCCACG CTTTACACAC TTCTTAACAT CTTTT ACTAC TAAAACTGAA 
17 7351 ATAAATATGG TTTAAAAAAC AGTGATG CTT CAAAAGC C AT TTATGTATGT 
177401 ACGCTGTGAA AAATGCACAG GGAAAAAAAA TCTCTGAGTG TAAACACTTT 
17 7451 TGTTAGATAG CTAGG CATAG AGAAAGCACA TCTGAAATTG GTGAGTTGTG 
1775 01 CATTCGCAGC GAATTAACAG TCCTATCTAT TTGATTTTTA AG GGAAGATC 

Zl gene exon 4 

177551 CAGAATCTGC TACCTGCAGG ATCTCTTGAT TCTGACACTG TATTGGTCTT 
17 76 01 AGTAAATGCT ATTTACTTCA AAGGAAACTG GGAAAAGAGG TTTCTGGAAA 
177651 AAGACACATC CGAGATGCCC TTCAGATTAA GCAAGGTAAA TTCCTTCAAA 
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177701 ATGTCTATTA TGGCAGAGCA AGAATCCTCT AAATATTTCA CCTGCATTTC 

177 751 ACATCCCAGT ACAACACTAC TTACAGCAGT AG CAGATGGT ATAAACTCTG 
1778 01 AGAACAGCAA CAGTGAAAAT AAATCAGCAG TCTCATTTAT AC AGATG CAT 
177851 GAGATTAGGA TTTTCAGTTA AGTTAGTAGC TTCTTGGCAC CAAAACAGTT 
177901 GAAAACACCA TGGTTAAGCA GCTTAAGGAC AAGAGAAAGT TTCTCTAAGT 
177951 ACTGAGATAT CATTTTCAGA AGGAATTGAG CTAATTCTGA GAGCAGTACT 

178 0 01 TCGACACCTA GGTCTCTTTT CATGCTTTTC AGACAGAGGC TGTATAATGT 
178 051 GAGCTCAAGT AGCCTAAGTG TTCTTTCCTA ATGCCCTGGC CATTGCGTAA 
178101 AACCTCACGT GGAATTCTCA AGAGGGTTTG TCATTTTAGC CAGATGCGTA 
178151 TGGATGATGT GTTCAGCATG CATTGTGGGC ACGACTGAGC TTACAGTATC 

1782 01 TCAGTGATTG TGCATGGACA ATTTACAGTA GCTGACAGCA TGCATACTTT 
178251 CGGCTTGTGT CAAAGGTGAG CAAAAAGAAT TTTCATTCAG AACACGTTGT 
178301 TTGACATGAG ATTACGAGTG CAAACACCTT TTGTATGTCT GGTGATGTGA 

1783 51 AGCAATTGTG TCGATACTGT GGCTGTGTTA TCTGAAACCT ACTACATTGC 
178401 ATGCGCAGTT TTAGGACCTG TAATAGTACA CGGTGCACAG AAAGGGTTTC 
178451 ATTCACAGAG TGGCTGATAG CAAAGCCTGC AAACAGATAA GCTTTTGCAC 

1785 01 TTGTGTAACA ATGGAAAAGA GAGAGTGGAT ATATCAGTGA AGGTCTCTGA 
178551 GCATAATACA GCGTAAGAGT TCAGATGATT ACTGTCTAAC GCGATTTCAG 

1786 01 TTGGTAATCC AAACCTCTAC AGTTTGGGAA AGAGAAAAAA CAAGCAGAGG 
178 651 T C ACAG C AAA T ATGGT C CAT AGGTAAATTC AATCAATCAG TGCTGTCCGG 

1787 01 AAGCATACAA AAGAGTTGAT GACATCCAGA GAATGAAAGT CAGCATTTTT 

178 751 TTCCCCTCCC AATCAACACA TTCACTCAAG AAATGTAAAG TTTTGGGGAA 

1788 01 AACTTGAAAC ATACAAAGTA GTTTCTTGTT TACCAAAGCT AACTCTTTCA 

1788 51 AAAGAGTGAG AAATACATTG CATGTAATTA TGTTATCAGG TGGTGTCTGT 

1789 01 GCTTTTTTTT TCTTTTTTTT CTTTTCTTCT GAAGATCCCT TTGACTTTGA 
178951 AACAGGAGAA ATGGCACTGG GAAAGAATAA TGCCAAGTCT TATACTTGTT 

179 0 01 TACAATTTTT TTTTGCCTTC AGTTCAACAA AGCAAGTAAT CTTTACCATT 

1790 51 CACCTTAAGG AATAAGTACA ACTAATCTTT TTCTTTTCTG TTCTTTTTTA 

Zl gene exon 5 

179101 ACAT CTGAAT CATTTCACAG ACCAAGACTA AAGCAGTACA GATGATGTTT 
179151 CTCAGAGATA CATTTTTGAT GCTCCATGAA CAAACAATGA AATTCAAAAT 
1792 01 TATTGAGCTG CCGTACGCGG AAAATGAACT CAGCATGTTC GTACTCCTAC 

1792 51 CAGATGACAT CAGTGATAAC ACTACTGGTC TGGAGCTGGT AAAACTGACA 

1793 01 TACTGCATCA CACGCACTAC AAAG CACTAA CAGAAATAGA TGAAAACAGT 
1793 51 GAGGAAGAAT GAACTTCAAA TGACACAATG ACTGCTCAGC CTAGGTTTCA 
179401 GGGCATCTAT TAATGATGCA AAAATACAAA TCTACCTGAG GAT AC AC CT A 
179451 AAAAAGTATG CCCACTCTAC TCTCTTAGCC TATTCGGTGC CTCCTTTTCT 

1795 01 ACCTCCAGAA TAGCAGAATA ACGAAAGCAA GAATCAAATC TAAACCACTG 
179551 TGCCCCAGAA TTAAT CTTCT GAGGGCAACA CTAACCAGTT TTATGT CAT C 

1796 01 CGCAGTCCAG ATTTCCACCT GATACTTTGT AACGAGGCTT TTCAAACTCG 
179651 GGGCTGACTT ACCTTGACCC ATGAGGTATC AGCAGCCACT CATGACCGTG 

1797 01 CCAGGATTAG TTC CTGAAT C TAAAT ACAT C AGAGCTTCAG AATCTAAATA 
179751 CAT CAGGGCA AATATCTTTT TATTTGCTCT TGAGGTCCCA TGCATTCCAC 
179 8 01 TTACTTACCA CTACTAAGAG AAATGCCTTA CAAATTCACA CATACCAAGC 

179 8 51 ACTTATTAAT GTGGTTAAGT TGGACACTGC ATAAAAGCAA CACTTCTCAT 
1799 01 ATCCACCTCC AAAATAATGA ATTATTCTGA AGGTTCACTC TACACCTCAC 
179951 TGCATTTAAG GAAACAGATA GAAGTACAGG TCACTCAGCA CTATGCAGGA 

180 0 01 TCACATCCTA AGAATATGCA GCACATTTCA GCTGTACTCA CAGCTGGTAG 
180 051 TTGGAC CTTT TAAATCTAGA GCATTAGACA CCAATGTATG CATGCCTTCT 
180101 TTTTTCTGTT GCATTATGAC TATATTCTTA TAAAATT CAT TGCAGGTAGA 

Zl gene exon 6 

180151 AAGAGAGCTG ACCCACGAAA AATTAGCTGA ATGGTCCAAC TCAGCCCGTA 
180201 TGATGAAAGT CGAAGTGGAA CTGTACCTGC CCAAGTTGAA GATTGAAGAA 
180251 AATTATGATC TTACATCCAC TTTGAGCAAC ATGGGGATAC AAAATGCTTT 
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18 03 01 TGACCCTGTT CAGGCTGATT 
18 0351 TCCTATCAAA AGTTATTCAC 
18 0401 ACCGAGGCAG CAGCTGCCAC 
18 0451 TAGAGTAACT TTCAAAGCCG 
18 05 01 ACAAATCCAA AACCATCCTC 
18 0551 GAGTCACTCC CTGCTCTACA 
18 06 01 GGCAGAGCTT GATACTCCTG 
18 0651 TTCAGACTAC ACATTCCACA 
18 0701 AGCCATAGCA GTGCCTGTAG 
18 0751 AGATACAAAC AC ATGG C ACT 
180801 TCAGGTGTGC ATCTTCACCC 
18 08 51 TCTGTAGCTC TTGACTAGTT 
18 0901 GCTTAAGTCT TTACCAGTTC 
18 0951 TAACAAGCAC ACACTCCCAC 
1810 01 TGTGACAGAA GAGCCTGACC 
181051 TGCTTCAAAA AT AG C AGTAC 
181101 GTGCTTTCTG TCAGCTCTAC 
181151 CAGCGACAAG AGGCTCGCGG 

1812 01 CTCCCACTGC TGCAGCGAGT 
181251 AGCCCTCTCC CCCACGATGG 

1813 01 AGGAGGGCGG GGCAGAGGCC 

1813 51 GTGGAGGGTC TGTGTTGTTG 

1814 01 GAGGGGAGAC GGTAGCGTTT 
181451 CGGGCGCGGC CGCCGGGCGT 

1815 01 GCAGAGTTCC GCACCGCTGG 
181551 GTGGGTTTTG TGGCAAACAG 

1816 01 GCCTGTTCTG AGTCACAGGG 
181651 CCTTATTGGA GCTGTAAGGT 
181701 AGAATGGAAG GACTGTTTCA 
181751 TGGCTATGGG CAGAAGGAAA 
1818 01 CCAAGCTCCT ACCAAGAATT 
181851 TGCTGCTACC GCTTGTTTGA 
181901 TGGCTCTGTA TTTTCTAGCA 
181951 AAAAAAAACA AACCACCTTT 
182 0 01 GGCACTTATC TGTAACTTAC 
182 051 GTAGGTATGT AGTATAACTT 
182101 TATGTGAAGT CTGACAAGCA 
182151 ATAAACCGTT TACTTTATTG 

1822 01 GCAAACTGCA GTGGACTTTG 
182 251 TCTGTAATGC TGACCAGATA 

1823 01 TTCAGTTGGA TTCTGGCAGA 

1823 51 TAATCTTGAA GGGAATACTG 

1824 01 ATAATTCTGC CATTCTGATA 

1824 51 ATAACTGTCT CTGTGGAGCC 

1825 01 TGCTCATCTC AAAAAATATA 
182551 ACTTTTCCTC ACTACAAGAA 

1826 01 AAGGGCAAGA AAGCCGTGAA 
182651 GGCTGAGGGG GCTGGGATTG 
182 701 GACTTTATCA CTCTGTACAG 
182 751 TTTCCTGGGT AACAGCAATA 
1828 01 AAAGGAGGTT CAGATGGGAT 
182 8 51 GGCACTGGCA CAGACTGCCC 
182901 CATTTAAGAA ACATGTAAAT 
182 951 TGGTGGGAAT GGGTTGACAG 
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TCACAAGGAT GTCAGCAAAG AAGGACTTCT 
AAAGCTTTTG TGGAGGTCAA TGAAGAAGGT 
AGGTGTCCTG GTGTTGAGGT CAAGAACACC 
ACCACCCTTT TCTCTTCTTC ATCAGACACA 
TTCTTTGGCA GACTATGCTC AC CTTAGTCA 
GAGCAGGAGA TGCTGGCTTG CCAGCTCAAG 
CTGCAGCTGA GGGACTAAGA CCTGCACTCT 
GCCCAAGGCA AAGCTTCAAC TACTCCAGAT 
ATGCATTTGA TTCCTTCCTC TTGCAG CAGT 
ATCTTCGTTC TCACAAGTAG AGCACCTGAT 
TTCCACCCTG C CATAATT AG CCCCTGCTCC 
CTTTTTGTTA CAGAGGCACA CACAGCCCAA 
ACTTCCATTC TACTGATTGC CTGAAAGACA 
GTGGGCTATT TCCTCGCACG GAGTTACAGG 
CATGCTGCTG ACTTTATACA AAGCAGCACC 
TGATAATAAA CAACCCCTCG TAGCTTGATG 
CAGGAGGGGA AGGCAGAAGG GGAAATCAAG 
AGGTAGCGAC CTCCGAGCTA AAATGGCCGC 
GCTCAGGGCC GCTTTCCGCA GCTGAGCTCC 
GCGGCCCGTG GCTAGGCAAA AACTTCCGGG 
AGGGGAAAGC TGGTGCTCGG CTGGGTGAGT 
TTTTCTGCGG GAAACACGCA TTGGTTTTTT 
CCCTCGCGGC GGCGCTCTGA GCGGTTTCGG 
TGACCGGGTG CTGGAGGCGG GAGGGGCCCC 
AATCCATCCC TGTCATCCAG CCCTGCCTCT 
GCGGAAATCG ATGGAGAGGT GCGAGCTTCA 
AGAGAGCTTG GCCAATTGTC CTGCGCCCAG 
GCACGGGATT AAATCGCTCC TGCTTCAGGC 
GTCCAAGTTT TCTTTTCATC AGTGTTTTTA 
CATGAGTACA GCTGCAGCTG TTGAACGTAG 
TGT CTTAGAG GAAACATGCC TGAGGAAACT 
GATGATGAAT CATTAATACA AAGTAGGCGT 
ACGTACCAAC ACCAGGCACT GCCTTAGGGG 
ACTACTAGTT GATATCCTGC GATGTCTGCT 
TCCACGTTCT GGCACTCGTT GCTCCTTCCT 
CGGATTAGTT AGCTACCTGC TCGGCTGACG 
CTGAGCTACG TATGTGC CAT GAAGTTCCCA 
CGTCTGTTTC CAT CGTGTAG ACAATAAAAG 
ATTTTGTACC ACAGCAGGAA AC C CCAGTAA 
AATTTCGTTT GAATATTGTA GATCGAGTCA 
CTGACTGCTA GGTCTAGAAC ACAAGTGAAG 
AAGACACACA GACTTTGAGA AGGTGAGTTT 
CCTTTCTGCT TTGGTTTTCC TGTAAAGCAA 
AAAGGAGACT TATTCTACCA AGTC CTAGTA 
GTATTATTTA CTCCATGAAG AAGACCAATG 
AGACATTGAG GTCTTGGAGT GTTTCCAGAG 
GGGTCTGGAG CACAAGTCTT ACTATTGAGT 
TACAGCCTGA AGAAGAGGAG GCTCAGAGGA 
TGACCTGAAA GGAGGTTGTA GTTGGCCTCT 
GGATGAGAGG GGATGGTCTC AAGTTGTGTC 
ATTATGAACA ATTTATTTTC CGAGTGGTGA 
AGGAAGGTGG TGGAGTCACG TCTCTGGAGG 
ATGGCACTGA GGGATGTGGG TTAGTGGACA 
TTGTACTAGA TGAT CTTATA TTTGCTTTAT 
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183 0 01 GGTTTATATT GAGAAATGTA AAAGACAGAA ATAGGTTGTC AGTTTGTGAT 
183 051 CAAATAAATT TAAGCCAATC TTCATTTTTT TTTTTCTCCT AGGCTTTGAA 
183101 CCATGGATAG CCTCAGTGCA GCAAATTCCA CTTTTGCTCT TGACCTTTTA 
183151 AATGAGCTGC GTGAGAAAAG CAGCACAAAG AATCTATTCT TTTCTCCTTT 

Z2 gene exon 1 

1832 01 TAGTATTTCT TCTGCTTTGT CTATGATTTT ACTGGGTTCA AAAGGGGACA 

1832 51 CTGAAGCCCA GATAGCAAAG GTATGTATCC AAACGTAATG TATTGGATTT 

1833 01 GATGCATATA TCATCTACTT AATGATATAT GAACTACAGA TCTGAGATCT 
183 3 51 GTATTACAGT CTGTGACCTC TAATTGCTGA ATTGTTACAG TCATTCTGGC 

1834 01 CTCAGAGGTC AGAAGTCTTC CTTAGGTATG TACATAAG C A GAACCTATTT 
1834 51 CTATTGAGTT TATGTATAGG ACTTACTGCA GTGTGAAATT AAGAGATTCC 
183 5 01 TGTTTTTTGG GGTGTGTGTG GGTTTTTGTT TGTGATACGG AGATCTTCCT 
183 551 TTTATATGTC ATTAACAGGC ACCTGGAATT TCTTTTTTTT TTTACTTACA 
183 6 01 TATTTGTATA TTTAGAGCTA TAGATGAATC TCCAGTTACA TAAAATAATT 
183 651 TACTCTGTAA TCTTTTTGGG CTTAATATCA GACTTTGCAT ACTTCAAAAA 
183 7 01 TGTAGCCAGA TAATCAAGGG AAAAAAAATC CAACATACAA GCATGTCATG 
183 7 51 TTAAACAGTC CCAGATTTTA GGAAACAAAC AAAAAAATGA TCAGTTGCTT 
183 8 01 GTTCAGTGTA ATAGCTTTTG TTTTCACAAC CTGTAATCTC AATCCTGGAA 
183 8 51 CATCCAGAAG AAAGAAGTGA TACAGGGCTA AGAACATAGC TCTGAAGTTC 

183 901 CAGAGAATAC CCCAGCAAAG ATTCAATGGG GCAAAGCTGC GTGGCCAGTG 
183951 AAGAGTAAAA TTCATAATGT AAACTTGCAA TTAAATTACC AGGAGAGCAG 

184 0 01 TTAAGGAGTG CAGTGGTGGG CCTGTTGTGT GACAGTAGGG TCAAATCTAT 
184 051 CATTAACTGC AGTGCAGTTT ATTCTACGTT CACTAAGGTG CGTGCCTGCC 
184101 TCTCTCTTTC TGGTATTGTA ATTTGGAGTA GAT C ATCAAT ACTTTTTCAT 
184151 TTGTAG CT AT GGTAGTAGTG ATGAGGCTGA ATGAGGATGA AGCTGATGTG 
1842 01 TTGTTTTAAT GGGAATTTAA ATATTTGCTT GTGTTGACAT CGGCTCCAGC 

1842 51 AGCCTATTTC CTGTTATCGC TTGAAGGATC GGGTTTGCAT CTAAGGTATT 

1843 01 AAATAAGATG CTTTGGTGCT ATT ATAAT C A GTGTGAAAAA TTATGGAAAG 

1843 51 TTGTTTTTTT TTATTTAATC TTCAGGCTCC TTTGTTTCTG GATTTTAACA 
184401 GTTTTGCTAG GTTTTATAGG GTGGAGATTA TAAATCCTCA GTTCTCTAAG 

1844 51 AAGTAC TGTG TACAGCATTA AGAAAAGGGC AGAATGTGTC TGCACTCAGA 
184501 CTTCTTTGGA GGCTGGATGG GTTCCTTAGA AAGCAGGGAG ATAAACCAGG 
184551 TAACCTCCAT AGCTTCCTTC CAACCTCAAC CATTGTGTGA TCCTCTAATG 
1846 01 CTTGGACAAA ATGAAGATAA AT AC CACT CA CTTTTCAGCA ACGTAATTTC 
184651 TTGCTTATAC AA CATC TGTG TGGATACATT GTACGTGACT TGTGTAATGA 
184701 AAAATCTGCT GGCTTCAAGT CTCAAAACTC ATTTAAAAAC AGAACAATTG 
184 751 TGCTGATGCA AGTGTGTCAG AGATTACGTG GACTCCACAG AAGGTATTTG 

Z2 gene exon 2 

184 8 01 TCTCTCTGCA GGTGCTTTCT TTGAACAAAG CTGAGGATGC TCACAATGGG 

184 851 TATCAGTCGC TTCTCTCTGA AATTAACAAC CCTGACACCA AATACATCCT 
1849 01 CAGAACTGCT AACCGACTTT ATGGAGAAAA GACATTTGAG TTTCTCTCAG 

1849 51 TAAGTAAACA TTAAATTTGG GTGTTGTGAA GTATAATGTA CTTGCTAGCT 

185 0 01 ATTCCCCTTG AAGGTTAGAT AAAGGCTTTG GGTTTTACTC TCCAAATTTT 
185051 TCTAGGCTGA GACTTACAAC CTGAGAGTCT ATGCAAAAAG CAGGATGTGA 
185101 ACAGAATGGA GAAGCTACTT TTAGATTATA TGAATGCACA ACTGGTGCAA 
185151 GAC C ATGAAA AAAAACTAAA TCTTCTAGGT TTCTTGGTCC ACTTTTGGTG 
1852 01 GGTTCTAGGA TCAAATGAAT GACAAATCTC CTTGCCTTTG ATAACCTGTA 
1852 51 GCTATGATGA AAACAACTGT TACTGCTGTC CAG CATGGGC AGAACTTTTC 
18 5301 TTTTTTCTTA ATTAAACAAT CCAGAGAACA TGCTGAGAGG AGTATGTGAC 
185351 TCTTAATATT TTCCTTATAA GTATATATAC ACAAGAGGGC AC AGGT A CGT 
185401 TGCATATACA TTACATATAC ATTATAACAT TGTATGTTCT CTCACTCAAG 
185451 CAAAAAGAAC AAACGGAAGA AACAAAAAGA AACAACCCAG ACAATCATTT 
1855 01 CTCAGTTGAG TACTGTAGAA TGTTCTGGTG TATTAAAGAA GACATTTGAC 
185551 TTCTTAATAA CAAAGAGGAA GATAATTCCT AGCTCAGATG GCTAATAAAA 
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1856 01 CAACTGATAA GAACATGTCA GACAAAACCT GAATGGCTTT AT AT C AAGCT 
185651 GGGGGAAGAG AGGATATAGA TTTTTCTCAG TGTACTTAAA AACATCTGTG 
18 57 01 GCTGAATGTC AGTAAAATGC ATTGCTAAAA AGCTGTTTTA AATGTTCATG 

Z2 gene ex on 3 

185751 GCAGTCATTT ATAGAATCGA GTCAGAAATT CTACCATGCT GGGCTAGAAC 
1858 01 AGACTGACTT CAAAAATQCT TCAGAGGATT CCAGAAAGCA AATAAATGGC 
185851 TGGGTGGAAG AGAAGACTGA AGGTGAGTGT TCTGCAGAAC TCCCTGCTGT 
185 9 01 ATGTAATGTC AGCCAGGACT TGCATAAACA GCTCTGTCAA GGTGTAATAC 

185 951 TGTCATTTTT AAAGCAAACA CAAACCTCAG CCATTGTGCT CTGTCTCTGG 

186 0 01 TTGGGGCATA ATTCCCATAT CTGATCTATC GTTAATACAT ATTAGATACT 
18 6 0 51 CTGTATTGCA ACAGTTGCTT ACGTACCACT GTTCAATTTG TGTTTTCTAA 

Z2 gene exon 4 

186101 AGGTAAAATT CAAAAATTGT TGGCAGAGGG AATTATTAAC TCAATGACCA 
186151 AACTTGTGTT GGTGAATGCC ATCTACTTCA AAGGCAACTG GGAAGAGAAG 

MAR- like element 
186201 TTTGACAAAG AGCGCACAAA AGAAATGCCA TTTAAAATTA ACAAGGTACG 
186251 CTACGTTAAT~" ATGCTGACAA TACAAAGGTC TTTGTAATAC AGAAGACAAA 
186301 AATTGTTCAA GCAGATTTAC CTAAGGTAGT CTGCATGGAG CTCCCTATGC 
186351 CCTGTCCCCT TAGTATGAAC ACTCTCTTTG TTTAGTTTCT GTTAAGTTTC 
186401 ACATAATTAC TAAAAA CTTT AATATCACAT ATTTATTTTA TACTCTCTCT 
186451 TTTTTTTCCT TTACTCTTTT TGTTTGTGTT TCAGTTGGTG AACTTGACTA 
186501 TGTCAGTGTA AAATCTGCAT GGGGAAAAAA CATTCATAGG TTCCAGGCAG 
186551 AAAAGAACTT CCGTGTGTGC AGAAATGTCT GAATATAGCA GTCATCTTCA 

1866 01 GTCAGAATGC TTTTCTTTCT GCTGTGTTTC TACCACTAAA TTGATAGAAA 
186651 TGAAATGAGG TGAAGAAAAA AAAAAC C ACT CTCCTTTGAA GGCCTCCATG 

1867 01 CTTGACTTTC TTTTGCTTCT AAAAGTG C AG CAGGGCAATC GAGGAGGACT 
18 6 751 TTATGTACTA TCATTAATAG GCTACGGCTG CCCCTTAGAG GTCAATTTCA 

1868 01 AACTCTGGAT GTCCACCCAG GTGTCGTGAG AGTGAACTGC TAATGTGAAT 
18 6 851 TGCTTAAGAA CTCACCTGCT TAAAATAACC ACAATGCAAA ATTGAAGCTC 
18 69 01 TAGTGCCTAA TTTCAAACTT CAGTGTTGAA ATATATACAG GAATGCTTGA 
18 6951 AACTGCTAAT ACCACTTTTC AAACAGGGAA TAATAATATT GCTCTTGCCA 
18 70 01 TACTGTATGC TATAGCACTT AGAAACCACT GCACTGACTT GGTTCCTGTT 
18 7051 AGGAAGGGAG GTTTTTTATC AGTTTCCCAC AGAGATGTCA CACAAAACCC 
18 7101 AAGCTTACAT TCTGCTTAGA GTTTTTTCCT CTCCCTCCTC AGGAGGCAAA 
18 7151 TCCAGTGCTG TTTCTCTGGG TACGAGGCTC AGCCTAGTTC TGAGATTAC C 
18 7201 CTTTCCTTTG CAGACACACA TTTATTTTTG AAGACTGCAG TTTTTGGGAT 
18 72 51 GCAGATGGCT ATTGGAACAA GTTGTAAGAT GTGAGACTGG GGAATGCTGC 
187301 CTTGGCTCAT CAAGTAACAC GCTGTTAGAT GTGCAACCAC AAACCTCTTC 
18 7351 CCTTACAAAA CTAAGTGGCT TAAATTTCTA TTTTCATCCT ATTGATGACT 
18 7401 AGT CACTGAT GAGCTACAGA AGT CAATGAG TAGGCTCAAA TAAGCAATGA 
18 7451 AAAATCGAAA GGGCAAAGCT GAAGTTTTAA GCTAGTTATT TTACAGTCTG 
18 75 01 TCCAGGAGTA GTTACTTAAA CAT AC C AGT A GTCTTCTGAG CATTCTGTGA 
18 7551 CATCTATTTT ATCTGTGACT TTTGCACTTT GTTGTGACTC ACTGGATGAT 
1876 01 TCTAGCATGC AGTGTGGGCT TTTCTTTGCT TATCCATCAT TTTCATGTGT 
187651 CACTGATTGC TGTTGCAAAA TCATTTCCGA CAT ATT CTGT TCTCAGAGTT 

187 7 01 CATGGCAGTC ATTTATAGAA TTGAGTCAGA AATTCACTGT CTCAATGGTC 
187751 TTTCCTTTAA AAAGAAAAAA CGGTGAAGGT AAGGGGAAGA AGGGATTTAG 
18 78 01 ACTCCACAGA AAAGGAGGAA AATAATGTAG ACAAAAGTAA CTGATGCTCC 
18 7851 ATGCAAAAAT GGAGAGAGAT GGGGGAGAAA CTGGTAGTAA GAAGACAAAA 
187 901 GATTAACCTC TCCATGTGCC TTTTAACAAT T C AAAGTG AT GCTAATACTT 
18 7951 TCCTAGCATT TTTAGTGGCT TAGTAAAATA TTTTTGTTGC CCTACGTCAG 
18 8 0 01 AGTAATTAGA GACACATGGA GTGAAATGAA AAT AT CGAAG TTGAAGTTAT 
18 8 051 TTTGTATTTA TTTAAAGCAA GGAATACAGG CTCTGCTTAT TACCAACTTT 
18 8101 GTTTAGAGCT TGTCACTACT TCTAAAGTGA GCAAATATGT ATTCTTGCTC 
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188151 CTTTACCCTA AAGCAAATTT CACAGATATC TCCAATTAAC AATTAAATCT 
18 82 01 CAGGGATCCT TACTTCTCAT CTCTTGCTTT ACGAAAGAGT GACTGTGCTA 
18 82 51 TACTATGTTA TGCAGTGTAC TTAGTTCTCT GTG CAGTCAA ATAGTAAAAA 
18 83 01 GCCCTAAGTA ACTAGATGCC TGCTTCATGT ATTAGGACTG TCATGCCAGC 

Z2 gene exon 5 

1883 51 CCAGTAGTAA CTCTTAGTGT CTCTTTCATT TTAGAATGAA ACCAAACCTG 
18 8401 TGCAGATGAT GTTCAGAAAA GGTAAATACA ACATGACCTA TATTGGAGAC 
188451 TTGGAGACCA AAATCCTTGA GATCCCTTAC ATTGGTAATG AACTCAGTAT 
188 5 01 GATCGTTCTA CTCCCTGATG CAATCCAGGA TGAATCTACT GGCTTGGAAA 

188551 AGGTAAGTTA TTGAGCTCAG TGCAAAGACA GTTTGTGTCC TGCCTTGGAA 
188 6 01 GAGAGTTTGG TGCTGCACAT GGATT CAC AG TTCAGTTTCA GAGCTATTAT 
18 8 6 51 AT CATTGATG CTCAAGACTG ACTGAAATGC TCCTTGTGTT TCTGCCCCTA 
188 7 01 AAGTGGCATG CCATCTATTA CTACTGGCCA AGCTATGTGC TGCTGTGCTA 
188 751 AGAGGCTCTG AAAGAGGCCT CAT C AGAAGC TGTAGTTATG GTGAAGC CAT 
188801 AGTATGATGA GCACCAAATG AGAGGGAATT TGGGGCAGCT CTTAGGAAGT 
188 851 CCTTACCAGA ATTT CTACAG TTTGTCCCAT AGGTCATCTT AGTGAAGACC 

188 9 01 TGG CAGATTG TCACTGCCCC TCTACTTGGA AACACGCTCA CAGAATAGTC 
18 8951 CAGGTTCCCT TCCGTTGTGA TGATAGAATA CAAGTCATGC TCTGGCCTCT 

189 0 01 TGTTTTTTTT TCTAATGCTG ATTTTAATTT AAAAAGTGTT GTAAGCAGGT 
189 051 TTTGTCACCA GCCCGTGAGC TGAAAGATCC TGAAAGGCTG AAGAACTGGG 
189101 TTCAGTTTGT TTGGGGCCTT GTCAGCAGTT CTCCCGTGCC TTTACTCCCT 
189151 ATATATAAAA TAAGGTTTTT ACAATCTGAT AATGTTTTAT AAACTGAACT 
189201 TTACTGTATC TACCACGAAA AAGAAAACAC CAAACAAGAA TTGACCTCAG 

1892 51 CTGAAGCTGT AGTCTCTAGT AAGTAGAAAC CTGTAGTGAC TTGTGCTTTT 

1893 01 GACTTGGGAT CCTGTAAGCT CCTGAAAAAG ATGCATATTG CATGTATGTG 
1893 51 TTTACATAAC ACACATACAC AGACAAAAGT AGAGATTAGT GCAAAACTGT 
189401 CACTATTCTT ATTTTAATTA CCTAATGTTG GGTTATGTTT CGTTGCTTTT 

Z2 gene exon 6 

189451 TTTGTTTTAA GCTGGAAAGA GAACTTACAT ACGAGAA.GCT GATGGATTGG 
189501 ATCAATCCTG AAATGATGGA CAGTACAGAA GTGAGGCTGT CTTTACCCAG 
189551 ATTTAAACTG GAAGAAAATT ATGATCTGAA ACCCATCCTG AGCAACATGG 
1896 01 GAATGCGTGA TGCGTTTGAC TTACGGATGG CGAACTTCTC AGGAATCTCC 
189651 TCTGGTAACG AGCTTGTGCT CTCTGAAGTG GTT CACAAGT CCTTCGTGGA 
189 7 01 GGTCAACGAA GAAGGCACTG AAGCAGCAGC TGCCACAGCA GGAGTGATGG 
189751 TGCTCCGTTG TGCTATGATC GTTCCCGACT TCACTGCCGA TCATCCCTTC 

189 8 01 CTCTTCTTCA TCCGGCACAA CAAAACTTCC AGTATTTTGT TCTGTGGCAG 
189851 ATATTGCTCT CCCTAAGAAG AGAGACAGAA GAGCTACCAT TAACGCAGTA 

18 9901 ATGTGATTTC TTTTAGGATA GAACTGCTCT TTTGCACTAA CTGCTTATTT 
189951 CCACTGTGCC TGAATCCCCT TATCTGGTTG TCATTTTGGG CTTGCGTAGA 

190 0 01 GTAACAAAGC CACTTACACA TACACAGCAG CTACCACTTG AAACAGCTGC 

19 0 051 CTTACACTTT GCACCTAAGT GGAGTTGTTT TCTTGCTGGC CCAAGAAAGA 
19 0101 TGAACATCCC ACTTGCTCAG TGAACTTCCA CCTGTCTTAT ATTTTCTATT 

MAR (0.658) 

190151 GCACTTTGCT TTTGTGTGGC CACCAGGTAG CAAGGTGACA AAGAGAAAAG 
1902 01 AAGTGGATTT TGTTTCTGAC TATAGTGGAA GATATCTTAT GCTCTGCTCC 
190251 CCATTTTTCT TCCTCTCCCC ACTTATTTTT AACTTTTTCT TTAATGTTTT 
19 03 01 GATAATAGAG GGAGATGAAA GGAGGCTTTG GCGACCTATT TGTAAGAGTT 
19 03 51 ACTAAGCATC TGCACTAGAC AGAGGTTTTA TTATAACTGG ATAGCACTTA 

1904 01 CACAAGGATG GGAATAAAAG TATGTCTGTA ACAAATGACC TTAGAGGTTT 
190451 TCATGGAGTA CGGA TTCTTA TCTTAACACC ACATGTGCCA CCTGGGAATA 

1905 01 TTAGCTATCA CTCACCTACT TCATTAGTCT TTTAAAAAAA GAATGTTTTT 
190551 AAAAACAAAC AAACAAAAAA AAACCCATAG ATGCCTATGT AGTATTTAAG 
190 6 01 TGACAGAGCT TTATTTTTGT TTTTCAGTCT TTATATGTTT TTTTCCTATT 
190651 CTGGGTTTGT AAAGCA TCTT TGTTAATGTG AATGCCAAAG GTTTCTTAAC 
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190701 GCAGTGATTT ACGTGTTTTG 
190751 TGAGTGCTGT GGGCATTGCC 
19 0 8 01 GGCTACTGTA AAGAAAACAA 
190851 TTCATTGCTT AACAGAGGCT 
190901 CCTGGATTAC GATGGAGTTT 
190 951 GGAGTCTTTG AGGCAAAGGC 
191001 ACTCCCTGCT GTGGGCATGG 
191051 GACACCTCCA AGGATGGGGC 
191101 TGCCTCACCA ACCTCCAAGT 
191151 GTTCCTTTCA TTTATTTTAA 
1912 01 AC C AAGGCT C CCTTTATGTA 

1912 51 TCTTCTCGAG GCTGAACAAG 

1913 01 TGCTGTGGGT GCAACTCCTT 

1913 51 AGATGCAGCG ATTAGTCACC 
191401 CCTGTCAGAC CGATGCT CAG 

1914 51 GAAGTGGGCA GCCTCTGCTC 
191501 CAAGATGAAA AGTAGAAATT 
191551 TGCTGGAGTG TATTCATGGT 
191601 TTGTCTAATT TGCTTCTTTT 
191651 CGTCTACCAT AAAG CAG AG A 
191701 AAGC AG C CAT AT CT AC AAAT 
191751 CTGCACAAGC AGGAAGGCAG 
191801 CTTTGCGTAA CCATCTGCCC 
191851 CAGAGACCTC ACAGCTCCCA 
191901 CTCTCCTCCT GGTTTGATCC 
191951 TAAGTGTGTG CTTGCTGTGC 
192 001 GCCCTTGGAA C T CGC AC TAG 

192051 CTGAAACTTT TATTTTTCAG 
192101 ACAGCAGGTT TGCACTTGAT 
192151 TCAGGAAACA TTTTCTTCTC 

1922 01 GGTCCTCCTC GGGTCCAGAG 
192251 GCAGCATTTT CGCTTGTTTT 

1923 01 ACAAGGCAAG GGGAGGAGGG 

1923 51 CTGTCACTCC TGCTGACTCT 

1924 01 CCAGAGCCCT CCTCTTGTGC 
192451 TGGGTTAATT GATGAATCTG 
192 501 ATTTAAAATC GATTAGCATC 
192 551 ATAGGACATG GGGGAACAGC 
192 601 TATTTGTTAG AGGAAGTTTT 
192 651 AGCTGCCCAG GGAAGCTGAG 
192 701 AGGTTGGATG GGGCCCTGGG 
192 751 CACAGCAAGG CAGTGGAACT 
192801 CCCCATTCTA TGATT CTATG 

192 851 AATGTATACA TACATAATGG 
192901 AATAAAAAAT AAAATATAGA 
192951 CAATACATGG TGATGTTACC 

193 001 CAGTATTTCT GAGGAGCAGA 
193 051 GGTTGAGCAG CAGAGGTTAA 
193101 CTTTGTTGTG GACCCAGCTC 
193151 GTTGCAAACT GCAAATTCTC 
1932 01 ACTCATGTTC AAAAGGG AC C 
193 251 CCTGCTAGTG CAAGGGGAAG 
193301 AACTTATTTG CCCTATGATT 
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CTGTTCTTGA AAGAATAAAC AAATTTGTTG 
CATAAATTTT GTGGGGTTTT TTTTTTTCAT 
GCAATCAACT TTCGTGTAGC TTATGCAGAA 
TTTCTGAATG CTGCAAGACC AAGATGCTTA 
AGGTTTTTAC CTTCGAAGGA TTCATAGCAA 
TCAAGGGATT TTAAAGACTA TCTGGTTCCA 
CTGCCCAGGG CTCCCTCCTC CCTGGCCTTC 
ACCCACAGCT TCCCTGGGCA GCTGTGCCAG 
G AAAAAG AT C CCCCTGACAT CTAATTGAAA 
AGCCATTCCC TCTTATTCCA TCACTATCAG 
TTGGAAGGCC ACAACAAGGT CCCCTTGGAG 
CTAAGTGCTT GTTTCCAGAT GGGGTACTGC 
GGCCCCAGGC CAAGGAGTGT GCCATGCCTC 
ATTTGGGGTG AGGAAGCTGC CAAAGTGCTG 
TCAGGGCTGA GAGCAGCAGT GGGTAGAAGG 
CCAGTGCATT GTCTGGGAAG GGGGTGGTAG 
TTTCTGACCC TTCCTACGTG TCCAGGCTGC 
GCTATGCTTA AAGTGAAAGC AAAAGCGTGC 
CTAAATTGAA AAGGAAAGTA AT C ACATTAA 
GAAGCTGCCA GAAAGCTTGA GAGAAGCTAG 
CCCAGTGCAA ACAAGAAGGA GGGAT CCCAG 
GAAGGTTTTA CAGCACTGTC TGCCGCCAGC 
GCCCCAGCAT TGCACCTTTC AACCCACTCC 
GTGGTCCTAG CTCCAGCTTA CTGCTGGCTG 
TCCCTAGCAG CTGCCAAGCA TCACAGGAGG 
CTCTGCATTT TGCAGCCTGA AATGAATCCA 
GGCAT CGAGG AGTGCTTTCT GAAGCCTTCA 

Z3 gene exon 1 
CTGCAGCCAT GGAGAGCCTG AGCAATGCCA 
CTCTTCCGAA AGGTTAATGA GACGAACCCA 
CCCTCTCAGT ATTTCTACTG CTCTGGCCAT 
GTAATACAGA GACACAGGTG CTGAAGGTCA 
ATTAAAATTA AATGTTGTTC AGTTTTAGAG 
CGTTATTTGC GTGAGCTTGG GGCAAGGTTC 
TCCCCCCTGC TGCCACCTGC CTGCTGCACT 
TCACTGATAG CCCTTCTTTC TCACTTCATT 
GAAACTAATT TCACTGATTT AT CAGT CTTA 
TCCAGCAGCA AGTCTTTACT AGAGCTTGTG 
ATTAAACAAA AAGGGAAGAT TTAGGCAAGA 
TCCACTGAGA AAGTGGTGAG GTGGTGGCAC 
GGTG C C C CAT CCCTGGAGGT GTTCAAGACC 
CAGCCTGAGC TGGTGGGTGG CTGCCCTGCC 
GGGTGGGCTT TAAGTTGAAT TCCAGCCCAA 
AGCCTTTTCC ACAGAGAACT ATTGTTTTGC 
TATACATAGT AATGCTAAGT GTATCTTATA 
GCTGTATTAT TCTAAGGCTG ACAACTGTTA 
CAAGACCCAG TGTTATAGCA GCCAAGCACC 
ACTCACGTGT CCATTCTCAT GGTAT C CTTA 
ATGAAAATGG TGTGGCTCCT TTACTGGGGG 
ATCAATCCTT TCCCACTCTC CACAACAGCA 
ATGTAGGTAG CAGTGCCAAT TCCCTCTCAG 
CTGCCTCTTT TTTAATTTGC AAGGCAAACA 
TATGAAAGAA TTGTCGCTGT AGTTCCTATT 
AAGTTCACCT TTGTATTCCG AACTTTAGGA 
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1933 51 AGAACTTGTT TAGACCATTA 

1934 01 AAACTGAATC ACTGCTTGTA 
193451 ACT CAACTGT CAGCCCCACC 

1935 01 GCAAGAGAAG CATAGGGAAG 
193551 GACTGGGTGA TCCTGTGTGT 
193 601 AAGGTGTTTC AGCACAGTAA 
193 651 TGTTTAATTG ATGAGAACAT 
193 701 TGTGAAGGGT GTTTATAAGC 
193 751 CTACCTTAAG TTGGAATCAT 
193 8 01 GGATCCAGCT CCACACAGCA 
193 851 CCTGCACACC TTGGCTAGTC 

193 9 01 TTCATATTAT AACAGCTTTT 
193951 ACTTATATTT TCATAAAACC 

194 0 01 CCTCACCTTC CCTGAGTGCT 
194 0 51 ACATG CAGAC AGCACCCTAT 
194101 GCTGTGGCTA ACTGTATTCA 
194151 CAGCACTAAG GGAC AGTTT C 
1942 01 TTCTTTCTGC TTTTGTTTTT 

1942 51 ATATGCTGCA TGTTCTCTAT 

1943 01 TCTTTCATTT C AG A CG TTTC 

1943 51 GATTCCGGGC TCTGACTGCA 

1944 01 CTACGGATTG CCAACCGGCT 

1944 51 GGTATGGGTA C ACAGAC CAT 

1945 01 TGTAACTTCA TCATCTGTTG 
194 551 AAAAAT AT CA CACCAGCAAG 

1946 01 ATAAG CAGCA TAGCTCAGGT 
194 651 GAGTATTTGA GGTAAAAACA 
194 701 CAGTACGAGT GTTGTACACA 
194 751 TGGACATCCT AAAGGATACG 
194 8 01 CTTTGATAGT TCAGTTCTGC 
194 851 ATGTAGGTTC TTAG CTTAGA 
194901 CTGTTGTTTT TTAGGCTTTT 

194 951 TTTTTGAAAA CATTTCTCTG 

195 0 01 TGCCTCTTTA CAAGGGAACT 
195 051 TTTTATCCCT CTCCAGCTCT 
195101 TT 
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ACTGCTGCCA TTCTTTGTGA AAAGACTATA 
GAAACAGACT TTGAACATAC ATTCCTTATA 
CAGGAAGAAT CTACTGAGAG CAGAAATAAT 
TTGGAGATAG AAGGTTGGGA TGAATGGTTG 
CTTTTCCAAC TTCAGTGATT CTATGATTCT 
CCTTCTGTAA TGCACATTCC CATGGTATAA 
CAGTTAATTA AGGAGATGAT GACTGATGAG 
ATGCAGAAAT CCATTTCTGG GATCATAATC 
AGAGTACACC ACGGTGGAGG GGATCCATGA 
CCACCCACTA TGGTAGATCC TGCTGCCCAA 
AGCTTCCCTT CAGGTATCTG TATGCACGCT 
AATTTTAAGG TGATAGTTGT CTGTAGAAGC 
AAAGGTTATA GCTCTCACAT TTTCCTAACA 
CAGACAAGCT CAGTAGTCCA CGGAGGAAAA 
TAGGACTCTG GAT CACAATT AACAGCTTCA 
GCTACTGCTT TACAAGTGAC ATGGCTGGCA 
ACTTGTT CTT TGATGGTTAC AGCTTTCAGC 
CAACTTAACT ACCAAACAAA TACCATACAG 
AAATACAGCA TTAGCAGTAG TTAGCTCATC 

Z3 gene exon 2 
ATTTTGATGA AGTTGAAAAT ATACACTCAA 
GACAT CAACA GAAGGGATTC TTCCTGTCTC 
TTATGGAGAG AAGTCCTACA GCTTTCTGCC 
AGCTGTGTGG TGGAACCTGG GGGGAGGCTT 
CTCTCCTGCC TCCAGAACGC GCCCCATAGC 
TCCAGATGTC AAAACTATCT TTCTGCATCA 
GTTGCTGTCT TTATAGGAAT GCAGCCATTT 
TGACTAGACA TCTAAAAGTT ACCAGGCAGT 
TGCCTATAGA TGCAGAAATG CATATGCATC 
CCTAGAGGAT ATTACATAAC AAATCCCTTT 
TGCTTTGGGG CTCAAGAGAA ATTGCAAGCC 
GTACAGATTA GCAATGCCCC ATTCCTCTGT 
CATTGCTCTA GTACTATATT ACTTAAAACA 
GGGGGAGAT T GCCATCATGT CTCAACAGCA 
GTACCTCTGC AT CTATTTAG GTACTGCTAT 
TTCTGGGAGT TTTTGTTTTC TTAGT CAAGC 



Fig. 1 
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SEQ ID NO: 2 

AAAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAG 60 

TTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGAT 12 0 

GCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGC 18 0 

ATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAAC 24 0 
C T C TA C AAATGTGGTAAAAT CGAT AAGGAT C CGT CGAGCGG C CG C 285 



FIG. 4 



SEQ ID NO: 3 

TGCGATCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGGCTCAG 60 

ATGAGGAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGATTTTGGCTTTCCTCAA 12 0 

GAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGACCATCCCTGTGCTGCACGAGATGATC 18 0 

CAGCAGATCTTTAACCTGTTTAGCACCAAGGATAGCAGCGCTGCTTGGGATGAGACCCTG 24 0 

CTGGATAAGTTTTACACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATC 3 00 

CAGGGCGTGGGCGTGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGGCTGTGAGG 3 6 0 

AAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTACAGCCCCTGCGCTTGG 42 0 

GAAGT CGTGAGGGCTGAGAT CATGAGGAGCTTTAGC CTGAGCACCAAC CTGCAAGAGAGC 4 8 0 
TTGAGGTCTAAGGAGTAA 49 8 



Fig. 5 
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Chicken ovalbumin Locus. ST25 

SEQUENCE LISTING 

<110> university of Georgia Research Foundation, Inc 

<120> chicken Ovalbumin Locus 

<130> U022 1060.1 

<160> 3 

<170> patentln version 3.2 

<210> 1 

<211> 195102 

<212> DNA 

<213> Gall us gall us 

<400> 1 



tgcaagcttg 


caagttttac 


tgtctcatat 


ggttcacata 


ccattcattt 


agcgtggcct 


60 


cacaaccaac 


ttgggaacca 


ctgccctaag 


taacaatacc 


tcacatatcc 


ataggatctt 


120 


caacaccatt 


ctgcttaaaa 


ttcaaatccc 


tttcattaaa 


atagaagata 


tttagataca 


180 


agataaatag 


agcatatgtt 


tgtatgtggt 


catcttcagt 


tctagaaaca gtttcttctc 


240 


tcttatgcaa 


ctgaaagtag 


cttagtttcc 


acagccccgt 


gcagagcaat 


acggttgtac 


300 


tgctctcaaa 


gaagtaccaa 


cacaacatgc 


atcagcagag 


ttgcacaaga 


ctactgaaat 


360 


aaaatttgct 


tagctgctag 


tgaaaacata 


agtatccttg 


cttgtaaaca 


aaagcataag 


420 


tcatttcaca 


cacctccatt 


tgcagggctt 


cagctaatcc 


tcgaagagca 


aacttggtcg 


480 


gagaataggc 


tgtataacca 


aaaaggccta 


attgcccagc 


ctgagatgat 


acaaacacaa 


540 


tccttcccat 


ccgtcgttcc 


ttcatggtag 


agatgactgc 


acgactcggg 


taaacactac 


600 


ctagataatt 


gactgccatt 


aatctctgtg 


ggaaaaaaaa 


aagatattaa 


aatggtattt 


660 


cagtcactga ttcagaattc 


ttcagatgtt 


tagttttatt 


tctccaaaat 


ccctactgag 


720 


gcttgcttta 


acatcaaact 


ccactacatt 


tctatcagcg 


agcaaagtaa 


acaattgttt 


780 


tctactgtat 


ttgtttcata 


aacgttccat 


agcaaagaat 


ctcaccgtaa 


gacaaaaaga 


840 


aatgcactgt 


atatacaagt 


gttaatggca 


ttgaggatac 


taatgatgtg 


agctattatg 


900 


gctttcacaa 


agaaaaaagg 


gcaaattaca 


aaaaaaactg 


acttgaacag 


tgtctaggaa 


960 


gcaatccatt 


tatacaagaa 


gtacaaaaac 


attgaagtga 


aagtttaaag 


ggcttcctaa 


1020 


aaaaaagaaa 


gaggaaaaat 


tctatgtata 


tatgcacaag 


ctctcatctt 


ggcaggtaag 


1080 


attaattcct 


gtttcaatct 


cctgaagacc 


taaaacacaa 


ctggtggcct 


acacaagcaa 


1140 


taaaacagat 


tatttaccct 


tactgttaca 


cttactataa 


cattatgcct 


attcagtatc 


1200 


acgtgcattt 


tcagatggct 


ctgtgatgaa 


gtgagaatca 


aagtaaccag 


aaagtagaga 


1260 


ggctgtttgg 


ttttttttaa 


tgtattttgg 


tttgggtttt 


actgttgttc 


ctttctttcc 


1320 


accaagctgc 


accaagttac 


caacagttta 


cataccgcag 


agttgaatgc 


tgacaagatg 


1380 


gattcttctt 


taacagattc 


aaaatatggg 


aaaaagaccc 


acacatcttc 


accaggaaaa 


1440 


aagagcccaa 


ttcagtgtct 


ttcacagcgt 


gttaaataaa 


agcacacatc 


aaccttcctt 


1500 
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tccatcttga 


ggtccttccc 


tatcccaggt 


ggggaattca ctgttctgaa ttgtccatgg 


1560 


accaataatc 


aggtaaaaaa 


ataagttgca 


atccttgcaa 


atctaaaaat 


gcatacaaag 


1620 


accaccacta 


acgaaatata 


ttgctactca 


cacagtccgg 


aaagacacag 


taaaataatc 


1680 


tattccttag 


aacagcggaa 


tgacaaatta 


cttgcaatga 


aataaagcat 


gatcactcac 


1740 


ttcaaaagaa 


tttacttcaa 


tatcctcaaa ttttcctgta actgatgttc 


ctgcacagtt 


1800 


gacaagcagg 


tcaactggtc 


ccagcttctc 


ctgagcctgc 


aagaggacag 


aaattaagct 


1860 


ctccagcagg 


aaaaaaagca 


ttcatatagc 


atagtgtcat 


cattaaatat 


actgccactt 


1920 


gaaacacatc 


gtacagagat 


gaccttaata 


catacgctgg 


tggaacagtt 


gcttctcacc 


1980 


tgttttagaa 


cgttctccac 


ctgttcatag 


tctttagata 


catcaacaga 


aatacacagc 


2040 


acaacctaaa 


aatggaaaat 


gaaagaacat 


attataacct 


cagtccaaga 


atggttggcc 


2100 


ccaagtctta 


caaaagaaac 


tgcacactgc 


ttcatgaggt 


catattaggg 


acaaaattag 


2160 


aaaaaaacaa 


gactaaaaaa 


aactgattag 


tggcacagag 


ctaaatatgc 


ctaaaaagtg 


2220 


atttaacgtt 


tactctttgt 


ctgggaacaa 


gaggttatgg 


atcttagttt 


gaacgataca 


2280 


ggaaaatgaa 


aatagacact 


ggcccagggg 


agaataaata 


aataaatagt 


aaaaaaaaga 


2340 


gagacagcaa 


tgtcttaatt 


tttgttgcag 


ctggaacgta 


ggtattgatt 


ttctttaaaa 


2400 


ctgctgatta 


catttagcaa 


cttgatctca 


gaatctaagt 


tttgaagcac 


cttccagtca 


2460 


acttccagcc 


aaaggaagca 


ctgactttgg 


ggtatcacca 


actgtaagat 


cactgccaac 


2520 


tccagcttgg 


ccggggattt 


acaaagagat 


tatcccctct 


gctaaacaac 


tatcaaggtt 


2580 


ctgaggcaac 


tcagtatctt 


gaaaggagaa 


gcaatcacat 


accacaatag 


aagtagagac 


2640 


tcctgtattc 


tcattctgat 


ttctacgtct 


tactttgtcc 


aattccctta 


aagctggttt 


2700 


gaggtgaaat 


aaagtcatca 


ccggattcta 


acaagcgtca 


tcagggtcac 


ttgtcatgaa 


2760 


gcacgccaac 


aaaaatgaag 


aaaagcttac 


caaagtgtaa 


ctaactgctt 


tggcagctag 


2820 


tcatttgtca 


gttttgcctc 


tggtcagact 


gcaacacttc 


ccagctacat 


aaacagaatc 


2880 


tgcttgccat 


tcctgtgcag 


caggttggct 


attcctgcct 


tccagcactg 


tgcatagctg 


2940 


gacatcggag 


taaacttgtg 


gtgcaagtta 


atggttggcc 


agcaagcatc 


tccccttagg 


3000 


gcatttttag 


attttagaaa tatttatctt 


ttacataagt 


ttgagaacaa 


aaataacagc 


3060 


tgagctaaga 


taacactgat 


ttactatctg 


gatgtcttta 


caagatcagc 


aaggttaaaa 


3120 


atcacagttc 


catacgaaaa 


ttggacatcg 


gaccacagat 


cagttaatca 


tactgagaac 


3180 


aatatactca 


ggaactacta 


aagcttgtat 


tggctaccag 


cacattatac 


aattcatctt 


3240 


tttgcttcta 


ttattgtatt 


tcctttctgc 


tacattaagt 


tgattacctc 


cagtccagag 


3300 


catgcactgt 


gaatgtggtc 


ctaataaaca 


gactatgctg 


ccaggaagtc 


taatatcctc 


3360 


attccagttt 


ctcgtctttt 


gttctaatag 


cgcattatct 


gaccattcct 


aaagcattcg 


3420 


tttctcaata 


aaagctcaac 


tccactccca 


gtgaagtaac 


taggaatatt 


ccatactgag 


3480 


aaagtgattt 


taaccatttt 


ccaaaaaaat 


tatgggcaag 


catttcattt 


cactgcatcc 


3540 
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ctttatttgt 


aaaggctgca 


ctgtcagcat tctaaataaa 


tcacatacaa 


atgtatttca 


3600 


gagaacacgt 


gccaaatcca 


acacctaaag 


gtacgttgac 


accagaatcc 


tggttctcaa 


3660 


tgcaacttag 


tttgacatat 


atatacatga gcagaatgca 


cagcatataa tcatgtaaaa 


3720 


actggtattt 


cctctggatt 


tttgctccct 


agcttggcat 


cttgcaaatg 


taacagcctc 


3780 


tgtggtctgg 


caacccatcc 


cagacaactc 


tatactttca 


gcaatcttct 


tgtacagcct 


3840 


ctgtctcttt 


cgctgccctc 


atttgtcagc ttcttatctg 


ccgttcttct 


tctatttgtg 


3900 


ttctttgaat 


ttagaacagg 


cagacttccc tgtgacaatg 


actacaactc 


aacagtaggt 


3960 


acacgagggc 


actcccacta 


cagccagatg 


aagtatggac 


aagtctaaat 


ttgctactgc 


4020 


taaaacaatc 


ctgtttagct 


cacactgaag 


cttgttcagt 


cttgaactta 


ctgaaatttt 


4080 


aactgaaact 


cttgcagaca 


catacctagt 


actcttctta 


agaaaacaga 


aatgtaagtc 


4140 


cacatctgtc 


aagcttggac 


agatttttaa 


caagagcatt 


gtgacagtgt 


ttgacaaagg 


4200 


atcaaagtca 


ttgtcaaccc 


tgggaagaga 


ctagagcaac 


taaagggaaa 


cccttactgg 


4260 


ggtttatttg 


tttttaaaag 


gggacaaaac 


gttgctcctc 


ctcacctcac 


atgacttcat 


4320 


atttcaagat 


aaatttgatc 


tgagtaaggc 


aacatgcaca 


gaaagaaaga 


tcaccttgaa 


4380 


ccatttgaca 


ggaatgagca 


aagacagaat 


cttgctgagt 


gttcttgata 


tcttcattta 


4440 


catcggctag 


tactaacatt 


gcttccaaat 


atccatcttt 


ctccactgct 


tcttttcagg 


4500 


agaatatcca 


acttatccat 


tcaccagatt 


attatacacc 


caatagacac 


caaatccctt 


4560 


atcacaaaca 


aattgcttgc 


acttgtaata 


aagaaaataa 


gaaccaacta 


ctggagcaca 


4620 


agagagagga 


aatcaaagat 


gcgatggtct 


caatgctgtt 


tcccgtctct 


tttggattat 


4680 


gagtattaag aaaaaaaagt tgtagtattt catcttgaag ctgtgttttt ttgaacacag 


4740 


aaagctgccc 


tattttaaaa 


catagcaccc 


cagcaagcat 


taatgcactc 


aaacagtgca 


4800 


ttaagcgtga 


atacattatt 


tatttaccaa 


agtacctcca gtgtttagaa 


acctcagtgc 


4860 


tttgccagga 


aaacagcaat 


acacagcaca 


acatagggaa 


tgaaatctga 


aaataagtac 


4920 


ctaaaattta 


ttccccatta 


caattgctgc 


atttacgatc 


tctcgaagag 


aaacttgaaa 


4980 


aaagtaggtt 


attctcctca 


atcttgaaca 


atacaaacac 


ttatcaagtg 


acagcaaatt 


5040 


gcagctatca 


aaaaaaactc 


taatacacct 


acaggtgaag 


ataccaatct 


ttttctctgg 


5100 


gaaaaaaagt 


tagcatacta 


accaggcaac 


atcaaatgta 


gctgagtttg 


gaacatcaca 


5160 


cgtagacaga 


ttggaatttt 


ttgaattcaa 


gtgaggcctc 


acctattttt 


aatcaaacag 


5220 


tttggtttta 


aaaaccagac 


tactgaagct 


atgttgaagc 


gactaccttg 


aaacacaaca 


5280 


gttataggta 


gcagcgatct 


tacctgcttg 


tcattaacag 


agtacttttc 


tatttccttc 


5340 


tttgtctgta 


acagcttgtt 


ctgaaacgga 


taataagaac 


agctaaagca 


ccgcagttat 


5400 


gagaacacat 


ttgtcttcag 


tatttcgaga 


agccctagaa 


acactctcta 


aaaacaagaa 


5460 


cattatcatc 


agctagcgat 


aacgctttac 


tatgttcatc 


ttagaggttt 


atgaacgtac 


5520 


aaaacatcaa 


ctccatgcca 


tagcaatgga 


agagttttaa 


ggcaagctca 


tcacaaactg 


5580 
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catgttagat 


tacactactt 


acaaaggcag 


ccatctctac 


tctgcaaata 


catcacattt 


5640 


aaattattag 


gatcacctta 


aagcatacag 


tggaaaaaaa 


accaatgagt 


aattatcaat 


5700 


tcccaaatta 


ctcctcaacc 


ataccaattt 


cattaatata 


aggtagccag 


caaaaataca 


5760 


aatacgctta 


cctcatctct 


tgcaatcagt 


gttatgaaag 


ctccttgctt 


ataacattca 


5820 


atagcaatac 


attttccaat 


tccactggag 


cctccagtaa 


cctggaagtt 


atttgaaaag 


5880 


tcagttagtc 


cgttattctc 


atccctaccc 


ctgtatcata 


acaagtcaaa 


tcaaaactct 


5940 


aaaagtgtac 


tgagattcaa 


atatcaccca 


gtgaagtatg 


aacacatgag 


atatacaaaa 


6000 


gtttctttac 


ttactgagta 


cgaatggaaa 


aatgaaatct 


aactaccttg 


tcaggcaact 


6060 


ctctctgtcc 


caaaaaccag 


ctttacctta 


atggtatttt 


ttccatattt 


cttatgtcag 


6120 


gctcacttag 


acccactctc 


acgtgccttt 


gattaactta 


tttttgaagt 


acggtaatag 


6180 


ttataagttt 


gcaatcacta 


ttaacattaa 


gaacgcaatt 


acatcctgtc 


cctccatgcc 


6240 


aggctacaat 


aaaatgcata 


aaaagagcaa 


tcatagcttg 


tctgccaaaa 


acaccgtgta 


6300 


aacaaacaca 


atttaaaaag 


ttatcttctt 


tcttattaaa 


taaacatcct 


taatttgaca 


6360 


gctggccccc 


tgcagagcac 


gtctcacagt 


tcacatctgg 


atgctgaact 


gcctcaatgc 


6420 


ccttctgtgg 


ggatcccagt 


tctaagcctc 


cataagcctt 


tccttcttcc 


tctaaaccat 


6480 


aactctcagt 


ggtctcccat 


gcaaacggga 


aataaacagt 


ttgcctgcag 


acaaatgcaa 


6540 


ggcgatggct 


acaacccccc 


actaataaac 


aggttaaagt 


cacctgaaca 


cccccctctc 


6600 


taccctccag 


aaaaatctat 


ttgtgaactc 


catttaaaac 


aaaacataga 


tggtgaccca 


6660 


actgctcttc 


ttcttatcgc 


aaataaaact 


gtcatcctga 


tactgacttt 


tggtgatttg 


6720 


ttcatttaaa 


cccagcctca 


gtttgtacgt 


gccttcagtc 


cttaaacaca 


cagagttgta 


6780 


ttttccttgt 


atggccaaat 


gtgttgaatt 


ttctcctaag 


agcgtatcac 


tgagaaaaga 


6840 


ttggaatgtt 


tcattgatga 


agggagttct 


gaaatacagg 


cttaaatatt 


tgagaaagag 


6900 


attcatattc 


ctctggaagt 


tgtcatttct 


gttctttatc 


aagacttcag 


agaaagtaag 


6960 


gtatcacctc 


tatttcaact 


atgagaaatg 


gaatgcactg 


agccaatcag 


tggaagagca 


7020 


aaggcttcca 


gtctcatctg 


cacgtgcctt 


tgttttgttt 


tcaggctcac 


taccctcaaa 


7080 


attctttcat 


attcctgggg 


gcagtaaagt 


caaaagagtt 


agaagtacgc 


tcccatgagt 


7140 


caccaagctc 


ttctttctta 


cgagtgtttc 


atcacaaacc 


ttcaaagaaa 


acagctcagt 


7200 


acacaaaggg 


tgatcatgta 


acctcagcaa 


acttattttc 


aacagattcc 


acagtttttg 


7260 


tctaatttgt 


ctgctgtcac 


cagtgggtca 


agaacttaga 


aagactaact 


tcaacttaaa 


7320 


cagagtttca 


tagacttccc 


ctattagaga 


aagtcgcttc 


aaagcagctt 


ctaatcgact 


7380 


ctttactacg 


ctaacctttg 


tgaatgactt 


taagatgcac 


gttctgttcc 


ggagatgttt 


7440 


aatgctgttg 


cattctgaat 


aaaggatggc 


tgcttcctaa 


gcaaattaag 


tctgtaagaa 


7500 


cgagtgttaa 


cgtttgggaa 


ccctagcatc 


atttaattct 


tcacaacaga 


aagaagttaa 


7560 


atttttagtt 


cattcccaaa 


gcacttttga agattccaat 


acatctctag 


aaggattagg 


7620 
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agattcagtt 


tgatgtattc 


aggcaaacct 


cagttccaga 


gcactcatga 


gctgccttta 


7680 


cagattttca 


gcagcagctg 


gtacatcagc acaggaacta 


atgaatgcat 


gtggaagact 


7740 


gctccaattc 


ccatttttgg 


cacagcactg 


aggcacagct 


gattgctacc 


tgtacaaatt 


7800 


gctaagtgtc 


atgaacaggt 


caactacagc 


agtgtttgaa 


ggtgaaggtg 


gcataatgcc 


7860 


tatggcttgt 


tagacaacgt 


taacgttcac 


ttcggaaaga 


caaggtctga 


agtcaagcaa 


7920 


gaaagggaag 


tttttatttg 


cagtcatacc 


caagaacgct 


agatagggct 


catgtcagat 


7980 


tggtcagtga 


caaaaagcag 


aatcagttag 


ctaggaactt 


cagcatttag 


gtggaaggct 


8040 


gttactaata 


cgttggaggt 


tgttactaac 


atgcaccacg 


agatgcaatg 


taacagtaaa 


8100 


gaaacagaac 


aactcatcga 


tcaatgagta 


agcacaattc 


caaatcatca 


caactatcat 


8160 


cagaagcgat 


caaaatgaag 


acttgatatg 


ctctttccaa 


cagggcttgc 


tagctttcac 


8220 


tgagcagcag 


cacaggatgc 


aagctggctt 


cttgctttta 


gacagtactc 


taacagtagg 


8280 


cattcagttc 


cactgaaaca 


gaacatcccc 


gtaacttgct 


ccaattggct 


ggtgtgcctg 


8340 


actggactcc 


aaatgacctc 


ttcctgaacc 


acccatgtgc 


atttaagcca 


caggtcacgc 


8400 


tgctcccaat 


gcaagagcta 


caaagctgag 


acacggagat 


gggtcaactc 


caggaacacc 


8460 


caaagcctgc 


tgtccttatc 


tgatgggctc 


tgcagggaaa 


tgctgaaact 


actcctagtc 


8520 


ctgagatggc 


ccttacacct 


gtggcttggg 


catactgaag 


cttaactatg 


tccaagttaa 


8580 


aggccctgct 


caaactgaaa 


ggaagacaat 


cacaggcagc 


aaatatcctt 


acattcctgg 


8640 


catagttagc 


ctatgaaagc 


aaaaatacat 


ttaaaagcat 


tttgttttac 


taaataagca 


8700 


aaaactgatg 


cttgagtcaa 


cctgatagct 


gtgaagtcct 


tcctatttaa 


cagtgcccac 


8760 


tgagattacc 


aaaacacaca 


gcagatatgg 


aacaggaaaa 


cacccccaca 


agattagatc 


8820 


agtaatttgg 


agttctgtgg 


agctccattc 


ctctctaagc 


agaagagaca 


cctgtggtct 


8880 


cggtcccact 


ctaaagcata 


cctcatgtca 


cacagcaagt 


gactccagga 


gaggacagga 


8940 


gggaaaaaaa 


ccccaacaaa 


aaaacaacca 


cacattttct 


gcctcagatt 


tctgtccttg 


9000 


tacctcaaac 


acctagctct 


attaaccatt 


acaactgtta 


ggagcagaga 


gaaatagatt 


9060 


ctacaaatga 


ttaagtaggg 


tgtgaaaaaa 


tgctgattta 


aattgagaat 


tcaacaagta 


9120 


actgctagga 


aacaatatac 


acagtagcgt 


ccaaacacat 


tcaatgaccc 


tctttaagaa 


9180 


atactacagg 


tattctcctg 


cacatatgaa 


gggggaatta 


tacatacaga 


tcacacaaac 


9240 


agattttgac 


tttttctgtg 


aaaacacttc 


aaggaagttc 


ttaacagtgc 


tacggtcctg 


9300 


aatattttca 


tgacaccttc 


tgataatttc 


agtagcagag 


aacactcact 


cccattatac 


9360 


gatgcaagaa 


ttcaccttgc 


acatgtgatg 


tgtacagcct 


tcacaagtac 


agtttctaat 


9420 


ttacaaaact 


atcatccaat 


gtttttcctt 


acacaattca 


cccaaacatt 


tgaaacccaa 


9480 


acatgcaagt 


ctaacagcat 


cagggttatt 


agcagcacct 


acatatcaaa 


aaaactaaaa 


9540 


ctctgaaaca 


actgctccat 


gtttcatgtg 


tttgcacaaa 


tgctttcaga 


gcataactga 


9600 


agccatagtc 


aaacaagtgc 


ttctgccaga 


agcaatgcca 


ctgaagctga tcacccacag 


9660 
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atttgtaccc tgatgccaat 


aaatacctac agcctttcct 


tctattggga tatgaacaca 


9720 


gttaatcctc 


cagaaggtgg 


tgacgcactg gaacaggttg 


cccaaggaag 


ttgtagatgc 


9780 


cccatccctg 


gaggcattca 


aggccaggct ggatgtggct 


ctgggcagac tggcctactg 


9840 


gttggcgacc 


ctgcacacag 


caggggggtt gaaactagat 


gctcattgtg 


gtccatttca 


9900 


acccaggcca 


ttctataatc 


ctacgaatag tcattttgag 


accatcactt 


atgtcaaatt 


9960 


caggttacgt 


ggctaataca 


attagcagta gtggctgtga 


gggaagattt 


ctccaacaag 


10020 


attattcttt 


gtcattttca 


ttgtgagcca actgaagtgg 


ctctttgaaa 


aaagaagaac 


10080 


cagcagagta 


gctttggaaa 


aagcgtaacg acacaaagaa 


aagacaacac 


tcgggataat 


10140 


cagattaaaa 


acaaacaggt 


ggacaatact ctgggataga 


atacactgaa 


cattttgttg 


10200 


cttactattc 


tgtttccacg 


caagcactgc agtaccctta 


ccctgcttca 


cctttgcttt 


10260 


tacacagtac 


agaaggattc 


ctgcttagca agagttaccg 


ctgtgggaag 


aacctcagag 


10320 


agccttcact 


cacgctctac 


tatctccagc aggacatgat 


gctgtaaagc 


cagttacaat 


10380 


acccagcaat 


acctattgca 


tcaagtaatt tgggaacact 


gttgcaactt 


ggacagctcc 


10440 


aagccgggac 


agctctatcc 


gcaaagagca gccctaaaac 


aaataggcag 


ataaaaatga 


10500 


acacaaacaa 


acaaggcacc 


acacagagct caaaaaaatc 


ccaaatgcca 


agcagctgtc 


10560 


aattccctcg 


cacctcagag 


gtctaacttc tgcattacac 


ccaagtcctg 


tagtaaccaa 


10620 


gtcctgtagg 


cagcctgcat 


gccctactca ccccccaaaa 


gcagattcag 


caagcaaaca 


10680 


gcacagctcc 


tctacacgga 


gcacaccacg ggtaactaca 


gctaagtccc 


cagagctgac 


10740 


tgaaggacca 


cagccgcccc 


gacccgctct cacgcaccca 


catgctccac 


acacacctgc 


10800 


aggcccctgg 


ctgctgctcc 


ccatcacgca cccgcccagc 


tttcagcggt 


acgctcctgg 


10860 


ggaaccgttc 


aaaagctata 


ttttcccgaa taaaccctcc 


caaaggctcg 


ctcctacaca 


10920 


gctgattaca 


gacaagccaa 


acgtcgctcg tggacacgga 


tacccgcgct 


gagtgccgcc 


10980 


tgaccgcttc 


ccttcgcgtc 


agcccgcccg tttcctcagc 


acgggtcgcc 


tttcagcccg 


11040 


tgccttccac 


cctcgtgagg 


agggccgcac tccagcacct 


cggcagatgc 


agcggggcct 


11100 


tccccgggac 


acggcggccg 


cggcctcccc gctccctcct 


cccgcccagc 


gccggcagcg 


11160 


gaccgctccc 


ccgcggtcgg tcagccagca gcggccggga tcggggtggg gagggggagg 


11220 


cgagaggcct 


cgttcgactc 


accacgacgt gtgcgccggg 


cagcttgagg 


ggcttggggc 


11280 


tgatgagcgg 


cgacaccatg 


tagaggagaa ggacgatggc 


cacgatgaag 


gcggccgcca 


11340 


ggagcagcat 


ggctcgggcc 


gggcttgccg ctggggaggg 


ggcggcgggc 


ggtgacaagg 


11400 


cccctggggc 


tgcgggaggc 


gccgagcgcg gcgcggcccg 


gcgcggcacg 


gacagcggga 


11460 


gtagaaccgg 


tgcccgcctg 


cgccgcggcg ccacggggcc 


acaggggagg 


gggaggagga 


11520 


agaggaggag 


ggaggaggag 


aacacggccg ccactccgcg 


ccctgattgg 


ctggtgggcg 


11580 


gggcgggcgc 


ggcctcgcga 


gcggggattg gccagtgagc 


gacgggaagg 


agctggcgga 


11640 


ttggccgaga 


ggcgggacgg 


cgctcggaga ttggagccgc ; 


aggctgtttg 


cagggtcacc 


11700 
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gttggaggca aagggcggcg gagaagagag agttcctccg gagaagacgg ggtgcgggac 11760 

ggcgccccct acggtccccg cctagggcgg gtgaggcgag gagtgaggta ggccccgccc 11820 

cttctcgtca cgtggggctt cccgccgaaa ggagggggcg tggctccgtg aggtgagaga 11880 

cgggaggctg cgggcggcgt tcacgctctg gaacgcgagg gcagctgttt gtggggaaaa 11940 

aaatgtgtag aagcgtggtt tcaaagcata ataataaaag attaactaaa acgaaaacgt 12000 

cttgcagctc aaataaaatg attccgtgcc tacgttcaat atttccttcg ctgtttatga 12060 

ctgaaaggaa ctccctgaaa tgatttatgt tgtaaacgct gtgcagcctc tgacttgtag 12120 

aagggagttt gcagcgtacg cggctttacg gcctcggaga gattacgatt acgggcgaga 12180 

gggcgtgcgg aagggtgtga agataatgca ggagatgaga ttggagcggg gagtaggcgg 12240 

aaatgggagc agggctgcgg gcagggatta ggcagtcgtc aaggggagag caaagaatct 12300 

ggaacaaaag aatccaggaa ttagttctgg aatgggattg agccgggatc gggcttcggt 12360 

gacgctttga gagtggtgct gtggggtgga ggtgtgagga aatgagagga agaaagagcg 12420 

cgtgctgagg taacagctgc cacggcaagg gtggggagag agctgacaaa gtggtgtgtc 12480 

caaggaaagg cggtgtggaa tcgtaggcat ccttaaggct ggaaagggtc accaagtcca 12540 

agcaccaagt ccaactagca gaagttggtg taggatatgg actaggaacg ctgcaagcac 12600 

agataccgac ttcattcttt gcatacaggg cagtgtatgt gttatctttt gtagaatatt 12660 

aattaaacac aaaggaggag attgataatg taatagagcc tatttatagt tatctagtgc 12720 

agaatatggc gagacttgaa aagcccaaat gtcagcagca tggagataaa gcagacggag 12780 

ataaatccat ctttcacaat gcgatatcgc tttcagaatc aacatgaggc aagggcatgg 12840 

ataaaaacaa tcatctgcag ttaatttcta gtaaaatgaa ggttaaacat gttggtaggg 12900 

ggcctctaaa aacctcaaat gcatgatatg ctcctgattg gtcacagtta ggatcacata 12960 

ttactaaata tttgagaagc ccttgtagat taacgaggaa tcccctcggt gaattttatg 13020 

cagaaatcca tactgtcttt tccttttagc taagtggcca ctttacaacc gtgtgattga 13080 

caatccaggt agcgtccact cacattttgt tcctggggca gtgaagtgtc atgaatttat 13140 

ctccaagaaa aacattcaaa agtgaagacc ttgtgaactg cttataactc accaatgtat 13200 

cgccacagca gtaggttttt gactcttttt aggtatgcca gcaggcactg aagtttgccc 13260 

tcctgagctg tctgctgtct ggtttgtatt tgtctcatgt gacctcattc actgaggaag 13320 

tgcgttcctg acacacggga atggtttgct acgaaactct tttctcagtg actgtggaac 13380 

tggaaattga accctaaaaa aaaaaagtgt tgaagccctc cagtccaaac tttggttgta 13440 

cataaagcag tatttaatta atctgacctt gattaacaac atcaaaaagt gtaattttga 13500 

agcacaaact gaccaaggta tgtatgtacc ttcgggatgg gtaagaaaat aaaaaggtta 13560 

acacatgcta attgctttgc taattaatcc ttagaagcag cttcaacaca acagcgatgt 13620 

gtttagagaa gaaaatcaaa tacaggtaga ttaaagcgtc caaactatag gaccagctgt 13680 

ggttttctgc ttcctcagtt ctgttcatat aatctttcaa cagacgtttg cagtaacaat 13740 
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gttgtgggtt gagataaatc agtatgaaca aagcatggca accgaagtaa gaaagtagtc 13800 

atttaaacac ggaaacaaat gtatgaattg ataatattac aacacaagtg actgatacta 13860 

gaggtgtcct tttgatcttc ttgttcccaa agcatacaag gtacacacag aagagacaca 13920 

ggctgtgtta agatgccatt aagagaaggc ataaaggttt gacagagcag gtagtgaggt 13980 

tgcagcctgg acagactttc ttattgcact tgagtactca tctgctggat tttctggttg 14040 

tgtcatattc acgttaggga gagaggaggg aaaaagagca ggatgcgtag gctactcagt 14100 

gattaaacaa aaaaaaaaag ctggaaactt cttcatgtga tttccatcca gtcagtcctt 14160 

ctgcttttag agaaagcagc atgaaggaaa aacttcagta gccaaggaga acaacttttt 14220 

ccttctgttt tcctgaatta acttactttc ctctccaacc ttctcccttt tgtgtagcaa 14280 

gcataggtgt tctatgctca tttcttaaga ggtctgttgc agtaatcatc ataagacatc 14340 

aaaggcatgt tggcagttct tggattcctg caaagcttca agatttagaa tgatggcagt 14400 

ctaggtgagt tgttcctggt caacaagctg tcttgatccc gtgtcccaaa tgagaagagc 14460 

taatagggac ataagaactg aaatcagaaa aggatttaca taacatgctg gcagtagagg 14520 

agaattgggc aagaaataat gatctgcaca tggtagtgac taaagcagtg tgactgaaat 14580 

acttatcaca cccagctgct tgccttgctg ttcttcccca aacaaacaag caaatccctt 14640 

gtagctgaac aatagcttct ttactggtcc atcacgctgg agagatcatc agctacccca 14700 

tgcatagcag ggtgaaacag ctcccagagc actgtgcagg tcaaagtact atatgtaccc 14760 

tgtctgctgg agtgctatca cggtgatctt ctgggtattc ctagaaggag atttcctgta 14820 

ctcccaagct caacgtatca tccagaaagt gctcgcctgc agcagggacg ggttctggcg 14880 

atctctgcag cttccagcta tgccgcatgc ccttatcgca atgaactcag gctgggctga 14940 

tggcccaggt gctggaggct gccagcacgc aggcaggagg tggttatagc agctcaggct 15000 

caggtcaaac caaggcttct tgctggggca gaggggactg actctgtggt gcaaaagcag 15060 

gtagtatata tatatgtata tatatacaaa gcccagctac cagctgagag tcccaaggct 15120 

gctgcagtag ttttgcaatg agcacacagg aaacaagaag atcgctgaga acactgctga 15180 

aatcagattt ctgtcttcac acaggtcaag ctgatttaac tgtttaatgt aattgctgca 15240 

gttgcttgga aaaaaaaaga aatagtaaaa ccatgtccaa aatgaaccat tcataactgg 15300 

tggcccatta tgtgtcacag ccgatgttgt gctgaataaa taactgtaca ggtattttat 15360 

atattgagca acatatttat tgaaacaaaa ataatttacc tcaaaccagc ggtaaaagga 15420 

agtctttact gtctaattta aataggcata agttaaactc gggactgaga tgatcttgaa 15480 

tttcatttgg tgcccatggt tctttttatg tggtacacct gcttacactt accatcacac 15540 

tggagcagtt tgcttttgcc acccgaatgt cagacactgc tatagattta cagtagcttg 15600 

ggggggctgc aggttggaag agggggttga ggcctcatca agtgccatgg caaaacaccc 15660 

tcaagtaagc acggctggaa gcaggaagga tgagggaatg agctgccatt tcctttgcgc 15720 

tggaaggatc actgctaaaa cttgtaaata tctgttagaa acaaacaggg acgttcactt 15780 
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tgtcctgtga tgcaagagca cccattctga atttttatct cctgcaaagt tgtatttaag 15840 

ctgatgttta ccgtggacgt tcgtgttaca agatagcctt tgatactatc aataacaagt 15900 

cctctttgat gaagtaaagc tacagagtca caaagcatgc acttgtctga ccctttgcct 15960 

ggctgcctgt ccaaccacgt tgcaccacta cacccagccc cacgagacct gctccagggc 16020 

caagggaatt gagcacttaa gggaaagtgc tttgtacaaa acatggcgct tatgagtttg 16080 

aaaacgtaga tccaccaaaa cctcctcagg cacgatgagt atattttttc tccactactt 16140 

acagcgctgt gaattctagt taagggcgtt ttgattccta aagaattttt ccttctaatc 16200 

atagacgtac tccagtcctt attccagaag gcttactcct tgtattttga aggtcttatc 16260 

ctgaaattgg gatgcagagc cattctgaaa atgacagtat tttaagactt tgctgcactt 16320 

actctggctt cccacatacc ttcctcttgc aaccttccac ctcccagaac tgcagcccag 16380 

cctatcctcc tctgccagaa aatcggatcc cacaggccct atctcacacc tcccggttcc 16440 

ccatcctcat ggcagctgcc ctctttccca aggcactcta tggagcagca gaactgctga 16500 

gtgcacaggg caaagatctg ccgttccgag agagcagaga agcatcgctc gggaatcact 16560 

gcactgctgc agcactattg tattctgcct ttattcagag gcagtccttc acctatgaat 16620 

atcactacta ccttactgaa tatatatttt caggaatatt ttcacttttt agccagatag 16680 

gaagcggatt ttgtaattac ccttccagca acttacagcc aattactgtc tctcctcctg 16740 

attcctgtcc agcaatttgg ttgcagttat tgcttctcca gagcgggcag aattttttgc 16800 

tttaggaaat gtacacctcg aggtaatctt tgaagagtga caggttctaa agttcacaag 16860 

tttgatctgc tttgggatta agctacctgc taaactacca cacgccatcc agtcaagcca 16920 

tttctattat gtgcgtatgg ctgattctta tcacaaaaga tcaagttaat gatttgcagt 16980 

cttcggcaag cctctggttt ctttgaactt gctttttgta agcgatattc tcgggtactt 17040 

tttgtgcttg tgaagctact gcagtgctct ggagattttc tttgtgctcc tggctgtcag 17100 

agttatccat ttctaggcct gcttggccat ccccatagca cggggagaac cgtactttcc 17160 

cattgccctt gtacctgcac ttgtaaaaac gctagaggaa ctgaaattac ttcaagttcg 17220 

tgccctgtcc tctttcaaag ccattctgag aactttcttt gcacaacctt tttacaagag 17280 

ttaaatccgt ttctagttcc aggcaacaca cttgtcatac acagcgctgg caagggactg 17340 

ctgtttattt cttgcttgga tgcaattaca cagccatgtg cccttgtttt cagtccctga 17400 

tccattatct ttggcattta ctgcaaagaa gctgctgtta cgcaatggaa atttagatga 17460 

tctctttttc ttagcttact tctcctctaa cccaagaaat gagtacagta tagcctgctg 17520 

aatgcaagga aacctgcacc tgcaaacttt tctccccact gcgtcactac caaatatgtc 17580 

agagttgctt gtacttctta agtctgtttc catcccctaa tggcacgaac cgttgccctc 17640 

ctgttgtcag actgcaaaaa ggccagcttg tacagatttg ccctgtaggt ttgaatggaa 17700 

gaagggaaaa aaatcagaga aactgccagc ttttgttctg ccgcttgtaa gcttgctttg 17760 

gtagaaaagt tgaagaaata ggaacatgct ttgaaatagg attttaaaag gaatcagctt 17820 
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cttatcttcc 


ctttgggaaa 


aaatagtgtg 


aaggacagaa 


taaatcagac 


ggaaaaagaa 


17880 


agaaattgac 


gtaagagaac 


tagtcgggca 


gaaaggagga 


ggtggaaaat 


acccaaaagc 


17940 


agcaggaaag 


agggaggcac 


aggttgccaa 


ttaacacttc 


gatcaaagga 


aaggcccgat 


18000 


caaaaccttt 


ttcctcctct 


aagaagcatc 


accccttccc 


actgcttact 


gcaatgaagc 


18060 


gagcttttag 


actaagactc 


aagagaataa 


ccccaatacc 


agtaaagcct 


gcagaacttg 


18120 


tttttttcat 


agctgacacc 


acagacaaac 


aaacaaataa 


ataaatagta 


gcgcagagca 


18180 


tcagcaccgt 


ggcagtcatt 


ccagcaatca 


cttccccacc 


gtgctctcct 


ataggagagc 


18240 


tgcagcacag 


gtcagcgtct 


cccaacccgt 


gcacttcttc 


acggacagat 


ttgcatcatg 


18300 


cagaccctca 


gattgcccag 


gaagaacaga 


actgcaatgc 


ccagaaagag 


tgtggaagct 


18360 


ctgagaattt 


atctgcctgc 


tggacagagc 


ccatctacac 


ctggaacaag 


cgggcacctc 


18420 


tctgtgctac 


cagtgctggg 


taaagaaagc 


tgtgcagcag 


ctcctccctg 


aacactggct 


18480 


acgttgtgac 


atcagccctg 


tggttcctgt 


ggcagctcct 


gcgcttctgc 


aactacatga 


18540 


gtctagctgg 


caggccacct 


gcttgtttcc 


atatcagcag 


cagccacgtg 


caccatgtgc 


18600 


accatgtgca 


gggggcctcc 


aggcaggtaa 


aaaaaacaaa 


caaacaaaaa 


catctcttaa 


18660 


ttacaggggc 


agagcagggc 


tggatacgaa 


caaacaaaac 


cataccaaaa 


caagcacacg 


18720 


tgtaaagagg 


aaaaaaaaat 


aaaataaatc 


acagctttgc 


agtttgtctg 


tcttcagagc 


18780 


aaatcaaggc 


tgtgattaat 


tcgttacaca 


tcagaactcc 


aagcaggctc 


aagctgagcc 


18840 


gttgcaactg 


gcattatgaa 


tggcacactt 


gaaaaacagc 


caggttgctt 


tccagattca 


18900 


tggaatcata 


tcatagaatc 


atagaacggc 


ctgggttgaa 


aaggaccaca 


atgatcatcg 


18960 


agtttcaacc 


cccctgctac 


gtgcagggtc 


gccaaccacc 


agaccaggct 


gcccagagcc 


19020 


acatccagcc 


tggccttgaa 


tgcctccagg 


gatggggcat 


ccacaacctc 


cttgggcaac 


19080 


ctgttccaag 


atgttagctt 


ctctaacatc 


ttaccacaac 


ataataatga 


aagaatattt 


19140 


aaaaaatccg 


tgatgggtag 


gaacttcctg 


gctgcagcct gtgctccagc 


cctcaggtgg 


19200 


tggaaggaaa 


taatcatttc 


tagttggaat 


tttcattttc 


tttttttttt 


cctcagcttt 


19260 


caagtaggca 


aacaattcac 


ttgtcttcca 


gagctcaaat 


cactgctgta 


agtaacagtt 


19320 


ttcatttgtc 


atttttattt 


cctctgtgag 


atggtgatat 


ttatagcaac 


attctcggtc 


19380 


ccttgcttgg 


atgacttgtg 


attgctacgg 


ttcttgtaac 


agcattgcca 


gaacagtagc 


19440 


aaaaggcaac 


tgctccagca 


ccggtttttg 


taagccatta 


cctgtagaca 


ctcatctgcc 


19500 


tacagtagta 


tgagtcagtg 


gaaattactg 


tttatagttt 


acagaccaca 


tgtgacaccg 


19560 


agcatgtttg 


aaagcaaagt 


ccctgccttg 


aatagctgag 


atttaaatta 


gctgaggcag 


19620 


cagaggagga 


gggagggcaa 


gcaaaagcag 


gtcttgccaa 


tccatggcat 


ggtgcctagt 


19680 


gataggtcac 


caagcaggaa 


agaaaaccca 


accctggctt 


cattatcaac 


atcaggccta 


19740 


tgctcaggtg 


cccgtgactt 


atttcctgag 


aagtctcaaa 


acacgaccaa 


cacctgtttg 


19800 


aactcctata 


agagagctta 


gcgcctgcta 


tgatgcaggt 


aggatactga 


tgtttatttt 


19860 
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cattactagt gcgtgacaca tccaaagaaa ttagctgtaa aatgtctagt attcctgcaa 19920 

aagaacgtaa cagatcctgc acgtggcagg taccatgcac agatggcacc aacggatgga 19980 

tgctggcttc ctcacacgtt gagttgttgt ggagttgctc tgatgaaggg gagcagcatt 20040 

tgtgagcatt cattcatgga gctggagtct cctaagcaag gtaacgaatg caaaggtggg 20100 

agtgttcaag tggcctaggc aggcttgggc agtgagccca ggtgaacctc atgaagtcca 20160 

acagaaccaa atgcaagatc tggcctctgc attgaggcag ctcccactac caatacaagc 20220 

tgggaaagga ctgagtgcag ccctgctgag gaagacctgg ggggtattgg tggttgggaa 20280 

gctggacatg agccagcaat gtaccctcac agcccagaaa gccgactgta tcctgggctg 20340 

catcaaaagt agtgtggcca gcacagcaag ggaggtgctc ctgcccctct actctgtgct 20400 

ggtgaggcct cacctggagt actgcatcca gatgcggagt cctcagtaca ggagagacat 20460 

ggacctgttg gagcgcatcc agaagaggac cacagaaatg ttctatggaa tgggacacct 20520 

ctcttacaag aacaggctga gagagctggg gctgttcagc ctggagaaga gaaggctgtg 20580 

agttgacctg atagcaacct ggcagtatct aaagggcagc tacaggaaag aagggaacag 20640 

actttttaag cagggtctgg tgtgatagga gaaggggaaa tggtttcaag ctcaaagagg 20700 

gaagatttaa gttagatata aggataaaat tttttacagt gaggatggtg aggcactgga 20760 

acccagcgtt gtggttgaag ccctgacccc tgagactttc aaggcgaggc tggctcaggc 20820 

cctgggcacc ctgatctagc tgtggtgtcc ctacgcactg caggggagtt gaactagatg 20880 

gccttcagag gtcccttcca actgcaaaga ttctgtgatt ctagtaaaca gaaagcgtac 20940 

agaacagtga cctagtcaaa aattgactat cggaagggcg tgtgggtaga ggtaggcagg 21000 

caaaactgta attaggtcaa agaaaaatga caggacaagc ttatctaata tttgggatgt 21060 

cagtagccaa atgccagtac agaggatgaa cagcaaccat taagaatttt ttacacaggt 21120 

aattctgaca acagagaatt tggggagtaa taattgaaat attattggta aaacggtatt 21180 

tttaaagaaa aatcaaggtg agagcacaat agctacaaca tagactaccc gctcaagaat 21240 

agaaggagca atgttttgat aataataaag tagctgttgg aaaagcagca aaattggaag 21300 

caaacagtcc atcaagtgct tgcaataggt tatgtaagtt gtgtgaatgg ctctaagtca 21360 

gccatgatta ctaggatgaa tctggttaag acaaacatgt atggaagcca accatgaaac 21420 

cacggtcatc attctggagg aaggaagaat tatgcagcaa aatcaaggca ttcctgcata 21480 

tttcaataat tcagagctat taaaaagctc cctgtcacga taatcttcag aaataatgtg 21540 

aaaaaaatac atagcggagc aaattttcat taggaagaca actaaataaa cacaaaaagt 21600 

agatcaaaca atggctcaac agaatattta aagcagtttc tttgcttcag ctgccaaaga 21660 

gcaaactacg atcaggtgca gctgactgat aggagcacaa aagctgattc aagggtatct 21720 

gcccagacga cgtgtcgaca tgttctgctc cactcattaa aacaaaagca gtcaactcaa 21780 

ctctgaaggc tagtagttga ataataataa taaaatcaaa accaaacaaa ccttaccaat 21840 

ctctaagaca gacaaaacca gtacttaaac cagggaaggg acagaactct ggattcagag 21900 
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attaatcagg 


tgacgtgggc 


aaagatacag 


ccagagaatt 


taatgagttt 


ttctaactct 


21960 


atgaaaatat 


gtgttgagaa 


aatccactgt 


tagtcaatgg 


gaagaaacat 


ctgtgaagaa 


22020 


caaagcaagc 


aagcacagac 


acaactgatt 


taaaaactat 


tttacccaca 


gaacataatt 


22080 


tttcatactg 


cagtcagagg 


tagcaatagc 


actagaagtt 


aggaaaaaaa 


cgtaggccaa 


22140 


gtagtcaaag 


actagtcaca 


gctggcagca 


tgaaagatat 


gcaagtaatt tatccagtgc 


22200 


ttagaggctg 


tggttatata 


aagcaaatat 


aacctttaat 


caactataaa 


ccaggcagca 


22260 


ttgtttagag 


tacataggtt 


gctctgaaag 


taacgcctcc 


tatttatttc 


cacagaaact 


22320 


acaactgata 


cgaagagcat 


aacaacactg 


atagagcata 


ttcccagcta 


caaaacacta 


22380 


cttttcaact 


cagtcatcac 


gattagctct 


gcatttttgc 


cagcgatgag 


tgagagcctg 


22440 


catgctacgc 


gcacagaaat 


ctgcaccagt 


ggaggtgccc 


caccgtcact 


ggtgctgaaa 


22500 


tgcaccaccc 


actgcctcac 


cgcgctcaca 


tccactgctt 


ggtctccata 


aatattcagc 


22560 


aagcattgat 


gaatgtcaag 


aggtgtaatt 


ttttctgtgt 


ggaggaattc 


agtgacacct 


22620 


ctgcttcatt 


tgcgcttcca 


gccaaatgcc 


attctgtcat 


gctgcctctc 


tgctgccatc 


22680 


tgtcgcacag 


ccagagcaca 


taatggaata 


ctggtaggaa 


ggttcaacca 


ctaccgccat 


22740 


accaccaatg 


acaccttggg 


ctgatgatat 


actaaaataa 


atactacctt 


tggggtagca 


22800 


ctcatagttt 


aggttaaccc 


tgaaaactga 


agtaaatgac 


acctccccct 


acctcccgca 


22860 


gccatttagc 


tacatcttct 


gggataactt 


agctaggaat 


ctgtgataat 


atttcaactt 


22920 


atcctcattg 


tagaaaaaca 


gcagaacaaa 


aggttatcct 


tagagctgca 


gttctaaccg 


22980 


gcagtgttta 


tttggaatat 


ttctaaaaac 


aacttgaaac 


accaaacatt 


aatgtttcca 


23040 


tttccatgag 


caagtagcga 


gatgcaagtt 


taaaacatac 


agtatatttt 


tctacgttaa 


23100 


agataaggga 


ttacacagtt 


aggtttaggg 


aaacccatag 


gacaaactga 


cctacgacaa 


23160 


caacaaagaa 


acgcatttcc 


tgagaatttt 


aagattgcca 


taaggactgt 


catataggac 


23220 


tgttataaag 


gtcaattaac 


aagtaattca 


ggcagtagct 


tcaatcctcc 


aggtgagagc 


23280 


cctgccagtg 


cgtggctcgc 


ttctgaagtg 


ttcaccagag 


gcaacagagc 


aaagaatcct 


23340 


gctgcaacta 


agatcaagtt 


tacaaaccac 


agtaacttgc 


atctacactt 


gaatttcccc 


23400 


cgcttgccca 


caaaggtcca 


caaaaagatt 


tgcagccccc 


tgaatcacat 


tcacattttc 


23460 


cagtgcgaga 


ccgaagtaaa 


gctgcaaaac 


tgaatgactt 


tggaaagaac 


atttcattat 


23520 


gttagcaaac 


aaaagctcag 


caccttgcag 


atcaaagaat 


ttgtatttaa 


gtgttttgtt 


23580 


tagctgtcaa 


acgtagaaac 


aaaagtctaa 


acaaaagtga 


tagttttgaa 


agtaacactg 


23640 


aagaaatact 


caagaacata 


actgatgttg 


tacattttac 


ttcatttaag 


tacagcaaat 


23700 


ttcacccatc 


ctatgattta 


tcgagtacgc 


aaaatatgta 


catagaggaa 


accaaaaccc 


23760 


ataaaaagac 


aatcatctat 


gtgcatatgc 


gcatgtaaca 


tatgcacatg 


aaatgtgcaa 


23820 


ttttctttta 


atgcaagtta 


aacaaagcat 


atgcacaaca 


gagttgcaca 


accattacag 


23880 


aacagagtgt 


tctgggtatc 


ctcatgatgt 


ttcgcatcta 


cagccagtgc 


aaacttacaa 


23940 
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ggcacaaact cagtgctgac accgtagtgt tgtaagttca ggcacatttc aatttgtagt 24000 

tcttaaagat aataatcaac agaagtgcta cttctgtact aaagtgccag cctcttccca 24060 

aagattaagc attaagttga tgtaacctgt acacagtaat gatcagcggc gttcggattt 24120 

aacctaacct atcactgcaa ggtctgtggc tatatcgtgc tatgcgctcc acacctctga 24180 

gggtatgctg cttcccaaaa tgcctccctc acactcttca aagactaccc atacctcgcc 24240 

agccttgacg cgtggactct tacaggttac tactcaatgc tttttcctaa ccttagccaa 24300 

acctctgata aaaccagact taaaaaatca gccatcggga aatctttcgc acacttgcat 24360 

ttaacaaacc tttgctcaat tgcatagtga catgtgtatc agctaggaaa gaattaaaaa 24420 

caaaagcttg ctgcttaagg caaaaatttt taacacagca cagcagaaaa agccaaatac 24480 

cgggttcatc agtatttaaa caaagcactg gctcatacag tcttctcctc acagtgtttt 24540 

cttccttact ttcacagcaa acacacacag tatgctcaat tagcaaattt tgttgcattt 24600 

ctctaaacgg agtgattaac acataggctg actgctactg aaaacacctg acaaatcgct 24660 

tctcttgcac cctcaaaaaa gggtttcttt ttgagcctac cagaagttga aaacccgctt 24720 

gcgcccaggt ctaatataac agctaaaact gatcatttaa aaattacaaa tatttaccat 24780 

gagttgccac atcactctgc taaaattgtg ttttccgtat tattttccaa tagaagacat 24840 

ttaatagaca tctgaagaaa acaatacaat ataaaagcgt aagggtcttt gcaaacagat 24900 

cttctattcc ttctgcaaag taagaaagga gagagtttat tggcatttat ttgcagtgcc 24960 

atcgataaag acacgagaat acttaagaaa gcaaaaagtt ctagtgatcc acagacatct 25020 

ttggcttagc cttcctgacc aaagtcttct gtaaacttct ttaacttctc caggtcctgc 25080 

tcattaactg ttggctttgt gctagctagc gacctgagca tatcggcctg tgagaaagaa 25140 

agcaagaaaa catgcattca gaaaacgtac cgctgctaac agtattgctg tgaagaaaag 25200 

taagctttga aaagcctttt aaaacaagtt gactgtagga actcttattg aaacaaaact 25260 

tcgagtaagc ctgaacattt ctgcacgtgg accacttttt aacctcctga cgatagacaa 25320 

ttagtgagtt tttacaggac ttaagccaca atctgaggtt cagctttaaa acaattcatc 25380 

cattcaacaa gtgttatcta ccactgctta ctgcaacaaa ctgagcttcc catcttacag 25440 

attcgtattc caattcactt ttaaggacat caggttgaag tggaaaacca tcacacgttc 25500 

ccacatattc caatgcccac caacacagaa tacttcatca ttgatctcca gcaaagtttt 25560 

actgctcatg actgctaact tctgtttctt cagctcagtc agttttgtat atttacattt 25620 

ggctactaga aaatggagtt cagaaaaaaa aaccacagag gtatgaactc aaattcagca 25680 

gttaagaaac cttattaaaa aaaaacgtat ataaaaagtc ctggccaaag gcaaagcgag 25740 

gagctgctca acacctcacg ttactataaa agcacagggt taagttaaag tcagcatcat 25800 

gattttctag gctttctcat ctcatcgtac tacagacatc ctacttagaa agaattcaag 25860 

tctgatcttt ttaatgacaa gaactgattc tggactctga aataagtccc tgtgcaactg 25920 

tagcacatca gagtctacct tccattagaa gcactgaagg aattgtattt aattccagga 25980 
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ttagtttaca caggcagaag ttttaaggca 




26040 




"t "tt "t c a "t a c c 


tcctccatgt 


gcaaatatgc 


agatatttct 


ctxctcaaac 


26100 


1* a nl~ n a "t a a t* 

Lay Lya i-yy 


tacatatara 


aagcagtgca 


ctctacttta 


gagggttttt 


gatccctatg 


26160 




tcctlztcatt 


cattacagaa 


acgtttgcac 


acaggaatgg 


ccatcagcac 


26220 


aaat"ct:aa"ta 


tcaaa"tcc1:t: 

L- \— H. %^ L- V- V* L- L- 


ccttcagaca 


atgcaattac 


attcagaacc 


ttttgctgct 


26280 


"t a a a a a *t a. a. a 

y y y wlalala 


atatacgagt 


gctcaatgat 


ttgtaacctt 


ttaaacaatg 


tatttaaact 


26340 


tcaatttct c 


tcaaatatga 


tgttttggtc 


tgtagacaga 


agcaaatatt 


ttaacatata 


26400 


caaaaaattc 


caqctqaatq 


ttagcaagag 


ctggctgcat 


catctgtgat 


gaagtataat 


26460 


ccaaactacc 


attgcatcca 


ccagcttttt 


acattgcatt 


ggttatgctt 


gcatttcttt 


26520 


"tataaacaaa 

uy L yyy , - uwu 


atttacctac 


agcatgttat 


tcccagttta 


cactgaatat 


aatttcccac 


26580 


ttctcaatat 

L. L* ^- LJ LA v- LH L~ 


caataataat 


gctacagagc 


aacaggaaag 


taacatatcg 


taqaacaaaa 


26640 


a"ttc"taaaaa 

LA L- L. LA LA y 


"tttitiaaa'tga 


ataaaagaaa 


aattaaagaa 


gggaggaaga 


ttcaaatqcl: 


26700 


a"tcta*tacta 

%-j L_ \«. L- LA v. vA w ^4 


catgccacta 


gacaataata 


aatgcttatc 


agggatggag 


agctggctcg 


26760 


Ly tA. n y 


tattatatta 

i_y Ly i^-UL ty 


tcatgctgtg 


tgttgcgatt 


aaaatgtcat 


ccagtaiigtic 


26820 


caaaca1~at"C 

L__ LA LA L4 LA \_ LJ L- V— 


taaaaacaaa 


gggctcagcc 


aattgccttg 


catgctggct 


ctaaaatgtc 


26880 


t_ v. y a. y lu l l i_ 


i_ Vw C4. y y y l. v. v. i_ 


gcaaagcaag 


aaacaccacc 


aaaaaataaa 


aaaataaaaa 

LA *_A LA LA L- LA LA LA LA LA 


26940 


raaatarcca 

\— C4. CA C4. \- O V— V- U. 


ccataoaaac 

V — , L. LA L» LJ L-4 LA LA LA V- 


tttaggctcc 


agtaatttat 


cccctggaac 


aizccatccat: 

L*L W V_ L LA L» LA L* 


27000 


atcatttctt 

y L- V— LA I— L* L» L- L- 


caactlzcaaa 

LA LJ V_- L- L_ V— LA LJ 


atcacctgga 


gagcaaggag 


tgaacaaatc 


taccatgata 


27060 


tttaaattca 


"tcactgatigg 


tccttttacc 


taataaatga 


atacataaat 


aaataaaata 


27120 


aacaaactaa 

U U. L. U U. L- V-J LA 


agctgaacat 


ctttagagca 


aaagcatact 


cttaattttc 


tgtacatgcc 


27180 


"ccaccca"t"tti 

V- \— LA V_- V- V — L_ L- L 




agtgaagtgg 


aattgtgtaa 


aggtgctggc 


atcgttcact 


27240 


ttgaaaacgc 


acagcagtag 


tcagatactt 


gaactcatac 


catgtcagaa 


ccaatgagcc 


27300 


tttaaaataa 

L_ L- LU.LIM y L- LA 


aaatacttat 

25 LA LA L- ^— L- I— ^ 


agaaagctaa 


tgtgccaggt 


ctactgtttg 


gagaagacca 


27360 


ctctcttctt 

L- V- L- L- L- V*» X— X- 


agtcctcagt 


cactttggga 


gtccattcac 


cactggttaa 


catttc taaa 


27420 


aaattcticaa 

LA LA LA L* L. L_ V-< 


t:aat:t:at:1:ac 

L» LA LH L« L* LL W L« LA V* 


tgactgaccc 


tcaagttggg 


ctgccatggg 


tgtcctttta 


27480 


agcttccact 


cactgcacta 


aaaagttccg 


ggcacctttt 


ctgacacaat 


ctctaacagc 


27540 


acttgataga 


aaataaaacc 


atctagtgga 


ggaacagaaa 


ccatcccttc 


t"tccagatac 


27600 


a*taaacaaaa 

LA L- LA \A LA V* LA y *-A *-A 


cctoaaaaac 

V- \— L. LA LA LA LA L^ V- 


tccatcagct 


gcctcttatc 


tttttgcaat 


gcatatctca 


27660 


aacctataa "t 

L^ LA V— L»» 1— L^ *— ^ *— 


tctaccatcc 


ttcctttgtc 


agtcactgaa 


gtatcacaca 


"tccccatigaa 


27720 


cacaaaacac 

L— LA V- LA LA LA LA. V- 


atacaaaaac 

LA L. ^4 LA LA LA 


gaaaaaagaa 


ctgcttttaa 


cagcagagaa 


ctggatttgc 


27780 


tgtttcaatc 


tgcttttaaa 


gcacagcgaa 


gaaaagcatg 


gattataata 


ctggaaactc 


27840 


aacttggaca 


aaccgctatc 


aataggctgg 


aacaagcaat 


gggttacagt 


gagttacaga 


27900 


aattgagcaa 


aacgctacaa 


acaggaggca 


ggggcagatg 


gcgattggga 


caagggggaa 


27960 


tagtttaaac 


caacagaggg 


gagatgtagg 


tgagatgtta 


ggaggaaact 


tcttactcag 


28020 
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agggcagaga 


ggcgctggca 


cagctgccca 


gagaagctgt 


ggtgccccat 


ccctggaggc 


28080 


gcccaaggcc 


aggttggatg 


gggccccagg 


cagcctcagc 


tggtgggggg 


cagccctcac 


28140 


catggcatgg 


ggttggagct 


gggtgggctt 


tgaggtccct 


tccaaccccc 


aaccatccca 


28200 


tgattctatt 


taactgggac 


aaactgctac 


tatggaaata 


gttaataaag 


caaaggtttt 


28260 


tcttataaaa 


ataagaatct 


gcatccaatt 


aaagcacaaa 


caaaacaagt 


ggaatagact 


28320 


tgcatcagaa 


cactcaaagc 


acggtaggct 


ttttttcctt 


tttggcaaaa 


gaggtaagaa 


28380 


ttgcctttgg 


ctgctctgca 


aactgtggta 


actgagatta 


tttcattgtt 


ctgtggcagg 


28440 


ctgaggcacg 


cctcagatgt 


ctgcaaattt 


caatgaaagg 


ctaaaatgtg 


acaacccatt 


28500 


ggccagaaat 


gccatcattg 


tataaaaaca 


acaatggata 


aatacttcag 


gcatcactgc 


28560 


ttaagggaag 


gaataaccca 


gaaaatccct 


gatatatcaa 


aatagccgct 


tattttttaa 


28620 


gcaaatacag 


tttacaacag 


ctcaaaatac 


tgtttcaaaa 


tgttctttga 


ttttaaactg 


28680 


ggaaaagttc 


atcaaaatac 


ctaccaaata 


ttcttcctca 


ccaccaaaat 


tacagactgc 


28740 


tggcgtattt 


taacaagttg 


ataaggcttc 


ctcactgcaa 


gcactggaac 


tttaacagat 


28800 


ctcttacatt 


ctgaaccata 


ttgtatttaa 


gcgttccttt 


cccttggtgt 


cttaagctga 


28860 


atgtgttcct 


tacaattaca 


tggagaaaag 


tgcccacctt 


cagttcacac 


tgactctagc 


28920 


tgttcagctg 


agggctctgg 


atgagttact 


ggtaaaaaac 


taagaaactg 


tcatcataac 


28980 


tcatgagcaa 


caactgctgc 


caacacaagt 


tgcgtgtatg 


acacgcagag 


caataaaatg 


29040 


aaagctctga 


aagcttccct 


ttccagagtc 


aaaagtccct 


gcagataaca 


agaatccacc 


29100 


ttcacctgaa 


gtttgtgaat 


ttctgtgaaa 


acaaagtctg 


cagtacaaat 


gtaaacagat 


29160 


tattttagtt 


tcgctctcta 


aaaccaaaac 


aacagcaaga 


aaaaactaga 


caagaaaaat 


29220 


actatcatgt 


tatttataaa 


atgtaggcga 


aactccaaga 


taagcaaaaa 


aaaaaaagtc 


29280 


ttatctatct 


atagttacac 


tctttttaga 


catcaactaa 


gtgtaaagta 


gttttcactc 


29340 


tacagcagca tccataagat gttccttgct 


gccccagcaa 


tgacaacgac 


cttactcagc 


29400 


cgtcttgcat 


cttaactact 


gtgacaagta 


acattagggg 


attcaatttt 


ttactggaat 


29460 


cttaggataa 


tcttaatttt 


acagtttgaa 


ggacatcctg 


agcaaacagt 


tgtgcagttg 


29520 


taattcctct 


gttcccacgt 


agataaggaa 


tacgtttatt 


tacacacatg 


cgctagaaaa 


29580 


acaattacgt 


aatttgatat 


agaagaagag 


caccactgta 


agactccgat 


ttaagttgaa 


29640 


ctccaaaccg 


aatgctttta 


acagcagtta 


tagacgtgaa 


gattgattag 


agcttggatt 


29700 


acacaacatg 


aatacctaga 


gatgaggtgc 


atcaacttat 


ggcaggagta 


ctcctttggt 


29760 


aggtaatgaa 


gaacagcata 


cacacatctg 


taagcacacg 


gtattacccc 


aaaccgaact 


29820 


tggcttactt 


acaacaagtt 


ttcagatcaa 


gttaattctc 


agagttgaag 


caatatgaaa 


29880 


aacgttttgt 


ttttacttac 


ttttttaaag 


tgagtagctg 


attgcacttt 


tctaacaggt 


29940 


tgcatcagtg 


catcgcgtac 


aatgatgctt 


atatctgcac 


cagaatagcc 


atcggttctt 


30000 


ttcccaagct 


cccgataatc 


tgcttctgtt 


aggagattgg 


gagtcgaccc 


gaggtgaagt 


30060 
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ttgaacatgg cagccctggc atggtcttca ggtaaaggaa tataaattcg cttctcaaac 30120 

ctggtttcca aaagataaaa gcactggctc acgcaggtgc acgatggaaa gaagtttatg 30180 

caaatcagta tatactttgt ttgtaaatga aactgctttt ttcttatgta ttataaatgt 30240 

ttaaaaatat atatctcaga tattctgcag cctgttctca taagtaatac catggctatc 30300 

ataagctaac atctacaatt taacaacgac ttccttttta tgacagaaag tctcttcaga 30360 

ctgtagtttc tccaggttca ctccagagaa gtttgtttta aaagaaaata actgaaggaa 30420 

aaggagtctt ttagttttta agtacatctg aacagttttc atagaatctt agaatcgcta 30480 

aggttggaaa agacccacag gatcatccag tccaactatt cacccatcgc caacggttct 30540 

cactaaacca tgtccctcaa cacaacatcc aaacattcct tgaacacctc caggctcggt 30600 

gactccacca cctctctggg cagcccattc cagtgcctga tcactctttc agagaaatag 30660 

tagtggtttt tcacactcaa agaaagagct gcccgataac acgttcacac aaccagtttc 30720 

taaagtttgt aagtagagaa cgttgtagtt ggaaacgaat ttgaagtctt actctcaata 30780 

tagttgttgg taggaatggt tgatacttgc ggtgcttcct ttgaagcatc tgttctcaaa 30840 

gagaggacga cctcccatca gggaaatagg accgactcca agttctgtag aacactatta 30900 

acttcctata ggtaagtggg cccaagccat gaaaaattaa ttctgttact gccacgctct 30960 

acaagctcct ttaagttttt cggacaagaa tgagagatac tcgttcacac tgcaaagaat 31020 

gacttgaaat gttaagtacc acattcgcct cttattcctt gtatgaaact acacatgcac 31080 

aggatggaag cgacctctgg aggccacatg gtttaaactc cccagtcaaa gcacggtcga 31140 

gttgaattaa gtacatcgat aaaatgacac tgtcaccaaa aaggattgtt tctttagcct 31200 

acaaaaatta ccattataca ggttgtatca tcatcacaac ataatcacat ttgtcacgta 31260 

actgtgtttg tcctttgctg ctctgcaact gaaagatcca gctaatcaga tacagataca 31320 

aacgtcatcc cattagagaa aggcagttga aacgtacact gaaagatcac acaaactgtg 31380 

tgaccagtac agcaaaaaca atgcttctgc attacttaaa ttctgtgaaa ttactcaagc 31440 

tatccaaggg tttgctaaag ttgaaaacga tagctctgct gcctcttacc cttctgactt 31500 

gcttatgttg taccttgccc cccatgctca ccaggagacc agtcagcaac gaaacacaag 31560 

ttttttgctt agtcaagtgg aattagctga ctaagagatc agacagacta caagatatac 31620 

ataagagaga acaatccacc acttaagtga aggggatatt tgactcagtc cacctcatga 31680 

gacatgcctg caagaatcaa gtggatcact cactcaaata gcctcaggat gaaccctcac 31740 

aatagttgca aatttcttag cataaacatg aatacatcaa tcataggcca acataccttc 31800 

tcctgatagc agaatccaaa acccagggta tgtttgttgc tcctaagacc aatattcctt 31860 

cattatcaac accaacccct ataccaagga agaaatcatt tcaccattta gaaaataaac 31920 

agagactgcc tgataatgtt ttagaacatt tacaaaacgc aagggggtaa agctgcacat 31980 

cttttcacat gtaagcaatg cattttatgc gtagctgaac tcctttgatt ctgaaaacta 32040 

ttaaacttac cttgcatctg gactagaaat tccgttttaa tccgtctagc agcctcgctt 32100 

Page 16 



WO 2004/094640 



PCT/US2003/039244 



Chicken ovalbumin Locus. ST25 
tcattttcac ttcttgaccc acatagtgaa tctatctcat caatgaagat aatagagggc 32160 

ttgttttctc tggcaagctg gaataggttt ttcactaatc ttaaaaaagg aaacagctgc 32220 

agttatctta ttgtacacac aagcaaaaac atgcaacttt ggattatgat acagtgactt 32280 

tgttaagaaa aagctaaaag taaaaaataa aatgaatccc acataagata ttaacaaagc 32340 

tactcaaaga tacaacatcc cttcagaact actaacacag cattaggctg agatgctgag 32400 

tgagatacca cagaataagg taactttagg cttcctagtc ttgttaacac atctcattgt 32460 

aacatgcaga gtggatatat caaaggcgct catcacttcc aacccatata tgcccatctt 32520 

ttatgtcttc aagattttgt ttgaaaacag aatgtagaaa aaaaaccttc acacagagga 32580 

agaaacaaca tgtattatct gcagggctac tgcaacagat gagccagaag gtgacaagaa 32640 

tcaaagtacc ccaacacttc agaccacttt gttgtacaat cacagctggg ttcagaaggg 32700 

cattgatcac cattgtgctg ctaatacctt tgtccaaact agttttaaaa acagtcttga 32760 

gtgctgaagc tgctgtagca caaaatacag tgcattatgg tacttttacc tgacactgca 32820 

ctgaagcaaa gaaacatcta aggtttgctt taacaagaca catgaacctt ccttccattt 32880 

aatttcttta gagtgtccta tctagctctg aaaaattaat ttcctcttga taatattttc 32940 

ctggaactct ggaaactcca acttacttct cactctctcc taaccacttt gagaccaggt 33000 

cagaggaaga tactgagaag aatgtggaat tgttcgcttc cgttgcaaca gcttttgcta 33060 

gatacgactt tcctgttcct ggaggtccaa atagaagaat ccctctccaa ggtgttctct 33120 

tccctgcaag aaagaaatca gctatcatca aaatgctgta tcaagagcaa gtctatcttt 33180 

ctgatgaagc ctccctaatg tactaagttt tctgtatgta cctaagaaac acctgtcaga 33240 

tcgatcattt acagctcagc tggagcctct gatatagcag cataatgctc ttctcagact 33300 

ccgcttacac tactcacttc aacagcagta tttagaatgg gaaataaatg ctgtaatact 33360 

gacctgtgaa caagtgtgga aatttaatgg gcaagataac tgcttcttta agagcttctt 33420 

tggcaccttc aaggccagca acatcactcc atttcacatt tggtcgctcc ataacaatgg 33480 

cacctataag aaaagattgg ataaatcact gatacgtatt tttccactgt ttgcttacca 33540 

tatatttgaa aaaagaaatc cacgtgtatg tttacattaa ataaaaacga gccatttcca 33600 

cacagatttc agcatcaaac agtgctactc aaatggatat tatttctaca gagatttggc 33660 

aatctttttt tctttaacca caataaacca tcaataagca gagagttgtt agaagttctg 33720 

cagtgtgcaa actaactctg caactgcgca gaaaacatac caatggcaga tacagaagag 33780 

tacacttcct aaaaagagat caacatgacg tacaccctga tgaagcaggc ccactacagt 33840 

aggatgcaca ggaaagcatg agcaaacacc ctgctgtgag cactcagtgt aaaaagaaag 33900 

cctggagtag agaccaacat caatctgtat tgcatccaaa ccagaagagg caaaaaagtg 33960 

tctcactaag ttgcagaaaa tgtgaacagt tcacacaaga cggattactg tggagagagt 34020 

aaatatgtgc actttttatt ttccccaata tgtcaccatt acaaaggaaa atcatggaat 34080 

ggtggagggt gatggaggcc cagcctgggg ccccaataca tgcagcaatg gacagtgagg 34140 
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tcaccgacca agcggttgtg atgtcagcaa tggaaatgac tgtgtcctcg ctagccctca 34200 

ctgtacagat ttgggatctg gcagaggcca gcgtgtactt gtacctggac ttctactgag 34260 

catagctgcg agactcggag cactgagcga gttggttgag ttgtgctgtg gggctgctgg 34320 

cagcagttct tggtgcccac cccacagtac caccaacgtt tcccccagcc ctgcctgtct 34380 

caggcagctg gggccacaca gggtgcactt gtagcagcag aggtgagtgg tgcaggacat 34440 

ggcctctgcg gcggctggtg gggaagtggg agggtttgct gctgagggac caggacatca 34500 

cagctgcctg cccatgggac gagtgaccat ggcctctctc tctctttgca gttcgtaaca 34560 

ccttctgcct gctgcagtac ctgtgagggg agcagcttcc cgacctcagc tctcccagcc 34620 

caccgcacag cccggggcca tggacgtgcc atctaactgg acctgcccca tctgcgggca 34680 

aattcgggag gatgtcacct atgtgacccc ctgcaaacac cagctttgct acggctgtgc 34740 

catctggtgg gcaaacaaga agccgagttg tgccgtatgc gggcaccaaa tcaccaccat 34800 

ccgatactcg gtgaggtcgg acgacgacta cctcgagtgt gctgtcccgc agcccgcagc 34860 

acactctgat gatagcctgc aggatgagca ggggcctgca gagccggtgc tcatcccacc 34920 

tgagcacaac ttccctgccg aggtctgggc tgccttcttc aaagaacatc agggagacct 34980 

cgagcccctg ctccactggc tgcaggagga gatccaggag gcgtccagca gtgactggtg 35040 

ggaggtggaa gtgggacagt ggaccactgt caacttcctc tgcgagcacg gcctggacga 35100 

ggaggccttg atgcgggagc tgcagccgat cactaacggc gatgtgctgc cctttgtaag 35160 

gcagctcatc agcaccgcta cagccctgta cggcccagcg atccgccgcc agctcgacca 35220 

ccaggaaggc cgtgctgcag gacagcggga ggacagcccc gcagccagcc ccagcaccac 35280 

cacctcccat caggagcctc ctgcctcggg cctgggccac tccaccagcc ccgcagggcc 35340 

cagcaccgag gagctgcccg gcagctctac tgggggaccc gggcacccca gcaccaccac 35400 

cgcgccctca gcggaggagt cgcaggagga gccatggcag gcggtggcag cgggcccctc 35460 

cgcccagggc agggaccgct cgtgtggggg gccccggcgc cccccgaaga ggaaggcccg 35520 

cagcagcccc caggcctcgc ccccacctcc caaaaggcgg ccccggcggc ggcgctaggc 35580 

tggcaccgca ctgccgtcag agcacagcgc cagcgggctg ggaggccaac atctacctct 35640 

cggcctgctg cttgctggca gaataaacat cagttaaaac aaagaagaaa atgtctctgt 35700 

gttattgaca agactcttgc tgttgctgtc cctacccatg ctgctttctc tctcttccgg 35760 

tcctagagga gagaaatgca actttatttc caccatcata attcagcatt catgacagta 35820 

ctaacaaagc acacataggc tccaaaaagc cgaagatgga cccctcatgt tgctctaatc 35880 

ataatccaac caccaggact tggctaaatt cctctcctat tgccaagctc tgggccacag 35940 

attacttcgt ttgattttag ctgctgagct gtggtgtccc cctcccttca gacttcccgt 36000 

tagtcagtct gaagataaaa actctgttac cagatgactt ttagatggga cagctcacat 36060 

ctgagctagt gacccagctg cacattttga aaccctactc aagacaaatc caaaaggcaa 36120 

gagaaatctt cccaaatgaa ttaatgccaa ctaccccaat gcttatcttt ctgtactcaa 36180 
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gcacggtgaa ctgttcagtt gccatttttc tctacaaagg gctttctatt agttcacaac 36240 

cagtttctgc tagctatttt cttgtcactt tccccttgtg ccttcagagc tctgtgaatt 36300 

ggttgatggc cattttctac aatggaaagt gtaccgctac tcgtggctaa caaataaagc 36360 

aagtgacatt tgttcacttt ttgtccatct ccttagagat ttttactttt cctgcacgcc 36420 

tttctcatca gatagaaagg aatatttttt gcttgcaatc tatatacagg aatccagcca 36480 

ctcactttta atgccctcaa tacttttgct aggttgatta caactcagtt tttcctgtaa 36540 

ccaggctcca tcactaaatt aattagtagg acaagtagga acatgagatt agttccaagc 36600 

tatcagttat gtggacctgg catactgtgg taatttaaat tagcacactg taagacatta 36660 

cccataccag gaaacaaatg gaacaggaca tcgatcatgg cttcctcatt ttgtaggtgt 36720 

aaaagaacag ctggaagact aagccaacag agcgcaaaag gtctttaaat atcaagctaa 36780 

gccacttctt ttctatgtaa aaaactactg ctagctgcta tatattgcat cactggatgt 36840 

gtacagcacg ttatttcaaa aacacaaaca attatgttac tcaactgagt aacacccctt 36900 

atcactgcaa cacgaggaaa tcccgcctgt tgctatgaac aaacaagaat ccatcttccc 36960 

gccttatcaa cttgagttca agccttcctg tgaaaatggt cctgcttata ctacgtactt 37020 

ggatgacatc tgttacttgg atgacatcta ttgcctctag gcaataatat gtcaatgcac 37080 

ataagagtaa aactagcaca gtctaacaaa atagctatct gggatcttgc aactactccc 37140 

tttggaaaat gttttcttga taaatgatcc aatttcaaca tatgcaccac tgaatttcat 37200 

ggcatgcaaa cccatactgt cataaagact gtacttctgg atgtaaagag tatatactag 37260 

ttgagccacc taaagacaac aagttaactg gcaaaacaaa caaacaaaca aaccccccaa 37320 

acaactagaa attcacttga ccaaagtcac ctctatttaa ataaatggag gcttcaaagt 37380 

taccttgaag ctgattctgt agtttctttt tctcaggatc ctctgactct ccttccccat 37440 

cactgtcatt cctgatttgg aaacaagaaa taaaacgttg aaatacactg agaaactgct 37500 

gtcctaggtc acaaatcaga aagcaggaag tagaaaaaac atcacttcga ggaatgaaaa 37560 

accttatgat tttagatttt ttcagctctc tacaagttta catccttgta gtcttgtttt 37620 

tctacactat attctaaccc ccccctctca ctgcaaccat ttcaacttct gtacagaccc 37680 

gagcccttcc tcttaacaca cttctacatg tgttgactca gcctctagga aacaaaagca 37740 

tcgtggaagc agcaaaatgg cttcactgta gatgctggca cttactcctt gtccagaatt 37800 

gcaactggtc ttggtcaatt ccatttagta ctacgaaact ctctagtctt gtcagaataa 37860 

aggaaactgg aagttaaaag tagaaaaaag tagacgagct aggggacaaa tggaatggag 37920 

acgtgtagcc tcatgtttcc ttctactata aaccagcaga acacagtaca gctcaaaaaa 37980 

ataaaaccca tgaaatgagc agacaatgaa agaagctgaa aatcagggga ggtttcaaga 38040 

gacaactgag cagttctagc tgttcaagac taccaaaaag ggcaacctca cccagagaca 38100 

ccattgtgaa acctttcttc taccctagca aatacaaaag aggctctgct tgttcaggtg 38160 

ggctgattca gctctcagat gtgcaaagtg aaaaacagat gtaataaaag gagagcggtg 38220 
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cataggcagg agcaaggcaa tagagcgatt cagaccatca gaacatcagc ctatgacaga 38280 

accttggaac ccctcatcaa atgtgagaca gataaaactc agaccacagt aatcatccaa 38340 

accaaaccaa ccaaatctgg acttattttc tagtcattaa gtattttcat gcagaagaat 38400 

tgtgttacta ggctcactgt catcgaaaca aaaagtatta gtgtaaaaca gctttcattc 38460 

ttcagtgaat gtcctacaga agcattgaaa gatgtagcaa acaagcacaa aaaagcccat 38520 

aatattaact cacattattt ttccttttta agcccactgt ccttcagcat tagtagttac 38580 

ctgaagcgaa gcacttcaaa aacactactt aaaatgatct ctgttgagat ctaagttgaa 38640 

tcttagaata agcggagttc aggaagtatt ttgctttacc ttctcccaaa acataccctt 38700 

ttccatcggc aggaccagac tctttaactg gctttggtgc agttttttct ctctttttca 38760 

gatattcttt cagtttttct gctctgtcca agtattccgc acatttcact ctaatgctct 38820 

gttttgcttt atcaccctgt gtttcatcta agagtgtgaa aagaaacaat gcgttgttaa 38880 

caacaaaaca cacgtgcatc attcagaaaa catctttatg tgttatcaag atacctctct 38940 

cagggctcac cacgcatcca aatgtttcat ttacttattt tttcccctat gccatggaaa 39000 

gaagtgacag gaaagaagtt aacgcctaca aatcaatggt aagtaatcac tttcaaatca 39060 

aatacacacc tgaacgttgc tttgccttaa aaacttgcct gaacacgagt aaggacagtg 39120 

gcactggaag ctttttctgt cagtctctca aactgctata tagtgtctta actacttttc 39180 

taaactaagc cattgagagg ctgacttctt gtttttagag accttttttt aatctaagac 39240 

cactttattt ttccccggcc tgctaatttt gaaagttgtg cacatcaaag gaagaaaaaa 39300 

gtcacaaaac atctgaaaaa atgaggagtg gtccaacagc cacagttctg ttagtcgcta 39360 

ctgcagtatt ccagatcagc aatcaagctt gaaaatatta agttcatgcg ctacgttccc 39420 

aaaagtccat cagtatggtt aaaagcatag ggaagtaagt ggcatgagtt aatgagcaca 39480 

aaacaacctg tggatactac taagagttct tacaagaagg gagcaggcat gcaatatgca 39540 

acttttgtcc ttgctataat ataacacctc agccaaacta cagagagcaa gtgtcaactg 39600 

acaacaacag tcagaagtta aacgttgatg tcgacagagg agactactcc gggcaatata 39660 

aaacttgact tcatcacccc atgcattaca cttacattta acaacgtgga ttaaatactg 39720 

cacagcatgc tggtacaaac ggaaggcttc ttcatagttt cctgctttat cttcttgtgc 39780 

tgccttacta gcgaggtcta tcgctttctg taacataggt aaataattca aatgagtgtt 39840 

gtgtgagtgc tttgtgcgat caaagaggtt tttaagctgc tgctctgacc gcttcttggt 39900 

ggccagcttt tctgctcctt gatgtttacc caaaagagct gctgttattg aagacttgct 39960 

gtcagttgtc ttcatcaaat cccatcggca tcagtgttga tactggaagt acacgattac 40020 

aaagcaatga aagcagcacc ctttcccttc tgacccagtg ccaggagttg gtttcaaaga 40080 

ctcattattt ggtaagcttc tcatgaaggc tttaggtact tgacgtacag aagtgagaaa 40140 

ttctaaccat ctcttcagtg tgcatatggg ggggagctca gtggacagga aacataccta 40200 

aattatcaca gaagttctat caaggacaat ttagagatgg atttttattt gtttgttgag 40260 
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ataatttcaa atacatctgg tcgtaatcta agacactaca tcggcctgta gatatattga 40320 

tattactgtt attcctttga tcccgagtgc tttttattac atttcagatt acattacaga 40380 

tttttattac atccttggaa catccgtact gcttcaggac aattaagaat gacaattcca 40440 

atgactaagg cacgtatgct taaaaaagcc agagttgact aacgctacct cgaacttcta 40500 

cagccctgtc tgcatatttc caccttctgc cagtttattt cccaaaggca gggacagcgt 40560 

gctcgtgatg actgtgctaa catcagggag caaggtgaag atattcaacc tcatcacagg 40620 

gttttcacta cacactgctg tgcacatact ctcaacagta accagacgct ctgatgcatc 40680 

tcagtcaaaa ccgagcagat aaactgcagc catcagagaa ggaggaacaa catttctcct 40740 

tctattgttt tgtcttgcct ttttggaagt agagatcacc tcattggatc catctgaaat 40800 

caagagtaat ttatttcaaa acaatcacct gacaagtaag actatggatc ctttgtgaca 40860 

agtgttgaaa acagagcaac catctgtttc tttgaaacag aacttggtct ttcctcactg 40920 

ctgacctcgt gctgccctct acaaattcat tgtagagggc aaaccattca aattcagcac 40980 

aacaaaaata aattccaagc aataatttct gttactttag tgatttaatt accacaggaa 41040 

cagtccaatg attcctggat gcagaacaac aaaaacaggg ctatgacaaa aatgacaata 41100 

tatccaaaca acaaataaga gttggacttg atgatccctg tgtatcactt ccaacccagg 41160 

acattctatg atgctatggc tctgtgttct aaatggcaaa gaccgcctct gttcaatggt 41220 

aactctctta acagggcatc ttagagccct gctcctctga aatacaaaaa caaaggtcta 41280 

catcctgtgc tgactgtttt tggtattttt tcaaataaaa acccagaaaa ccatcacttc 41340 

ggttttagac tctcagctct ggtactttat tacattagga aggctcttag cctgctactg 41400 

caatgaaaaa caccagtaac aaacaggaaa taatttatga aagttgtatg aaataaggca 41460 

tagctgtaac cataaatgag gcacaacctg tatctatggg gctatagttt gagagctgga 41520 

tgaacaccac cctcagaacg aacatcggct ttgctcttct gcttactctg ggccctctga 41580 

tttcacagaa gggcgcaggt tggaagggac cgtaaagccc attcagttcc aatccccctt 41640 

gcatggtcag ggctacaccc caccagctca gtccgcccag ggccccatcc aacctggcct 41700 

tgagcacctt caaggatggg gcaccacagc ttctctgggc agctgtgcca gggcctcgca 41760 

accctctctg agtaaggagt ttcttcctaa catctaactt taatctcccc tcttttggtt 41820 

taaaaccata cccctttgcc ctacctctat cagaccatgt aaaaagtcag actccctcct 41880 

gtttataagc tcccatcaag tactggaagg ctgcagcaag atctctccca gcttggtcac 41940 

tataagcact acatagcctt aagcttacag gcatggacat ggtttaaata ggtttaaaac 42000 

tactttttgc acagattatt cctggatcta ttttgaaccg gcaacacaag cagttcactc 42060 

ccacaaccga aggctaaaat aaaataaaat aaaataaata ataattaaaa aaaaaaaaaa 42120 

aaggaataga gagcagacaa gcatttccaa gagtcgtact ctcagcagaa acccagtcca 42180 

aactacgcct ccagctcaca gcaggccgca gtcttgcctc agaggccaac gggtcttctg 42240 

gtcccagccg ggcaggtgac tacccggggt cctccggcgc ctccgagccc ccacccaggc 42300 
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ctgctcgacg ccccaccgct ggtgtcagcg cttctgcccc caggcccagc ctggcgcccc 42360 

accccgccga gcccgccctc ccacccgccg gctgcagcgc accggggttc aacaggaccc 42420 

gctctacctg caagttgccc gacatggcgg ggagccggga aggggaagga cacgagacga 42480 

cactggctac ggccgaccgg agctgccctt cccgccaccg ccgcccaccg aaccgaaaag 42540 

ccggccttcg ctagccgctt ccgcacctca gcgccggccg gcccgcttcc gcttccgggc 42600 

agcgccccgt acgcgtcact tgacgtcagc acgccgcgcc tcgccccgcc ctatccgagg 42660 

ggctgagcgc atgcgggccg ggcgccggaa gcggaagttc gtgggttggc gcgcagcagt 42720 

ggtgctgagg gaatgggggt ggtgttaggt ccagcactga cgtaggggat agggctgaga 42780 

tctgatcatg acctactgtg gggagcctgc tgtagcagag gttgggctgg atgctctcca 42840 

gatgtccctt ccagtccctg cgattctatg atcatttctg taaaatgtta aatagtcact 42900 

tatagggttt tgaataaatc acgttttttc ctcatgcctc acgtttggga cacaaagaca 42960 

tttttttctt acatctcttc tttctcgtac catttgcttg ctttcagcgg cactgtcttt 43020 

tgcataatct gagtgcagaa tgctttttat tcacagaacc agctcttaat aattcctgac 43080 

agtcataagc agtcaggcgt tagtcacctg cagctcagta atgaaactca actaacaggt 43140 

ctgcagagta agagcaatga cgtgactcag aaagcacagc acattgtaaa caactcttgt 43200 

aaacttgcta tatgggtttc agactaatga acttctgcta agtcggtgca acagttgtgt 43260 

taaattactg tcatatcctt ccctatgtta ttgtaatact gttgaggaaa tgcttcctta 43320 

gattcacaat cctcgttttt ctacctgcct ccaactaagc ccagtacagt ctgctctggg 43380 

atgaaggtaa aaggcacaag cacagtcagc cctatatcta ggaaggttga tgtaatttct 43440 

tcctaaagtc ctctgcttgg cagcttgttt tgcttaatgt cttcatatgt gcacaccagg 43500 

caggatgctg aaggctcgtt gtttggggat gatcagtaac agctgttctt ctattgcaaa 43560 

tgtgaaaggg tacaatgtag caaaaattcc tggatgtaat caggctctgg gaaatgagaa 43620 

ggcaaaggaa atgttggagg taagagcagc gttcaggaac cagaatgata tgggttggaa 43680 

gggatcttaa agatcataga atcatagaat cgctaaggtt ggaaaagacc cacaggatca 43740 

tccagtccaa ccattcaccc atcaccaatg gttctcacta aaccatgtcc ctcaacacaa 43800 

catccaaatg ttctttgaac acctccaggg tcggtgattc caccacctct ctgggcagcc 43860 

cattccagtg cctgaccacc ctttcagaga agtagtattt cctaaagtcc agcctgaacc 43920 

ttccctggcg cagcttgaag ccattccctc tagtcctacc actagtcaca cgagagacga 43980 

ggccgacccc cagctcacta caacctccct tcaggtagtt atagagagca ataaggtctc 44040 

ccctgagcct cctcttctct agactgaaca atcccagctc cttcagccgc tcctcataag 44100 

gtctgtgctt cagacccttc tccaactttg ttgccctcct ctggacacgc accaggctct 44160 

cgatgtcttt cttacagtga ggggcctaaa actggacaca gtacttgagg tgcagcctca 44220 

ccagtgctgc gtagaggggg agtcatcttg ttccaaccct gttttcctgt aggtagtatt 44280 

tctggctgtg ccatctgtac ctatggtttt caaatctgta atgctacacc tagcttttag 44340 
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acctaggtct aaaacagtac acaagtcaca ggcatgttag taatgcctct ccagtcacac 44400 

tttgcagtct tccgaaactc cacatataga catgtttcta tgattgtgaa tgagattaaa 44460 

aaaaaaataa attaataaat cagaaaaggc acgtgtatat ttacagataa caggctaaat 44520 

attatacttc ttaattaagc tttactatac agtattcctg ttatgtgact ttgcagctag 44580 

ttttgcctaa ggaaatactg gctgaatgct gagtaataac atcacgacag actcctgagg 44640 

agctaatgaa gtattacacc aagagtgtag cttcagtttg agagacgtgt atggtcacat 44700 

tttggaatgc ttcccattgc tgagttgctg tgttacaata ttctcaaaat ccgtgtcagt 44760 

tattgtgttc aactgagtgt aatgacaata aaatatatta atgacgttaa atgaagatat 44820 

catagaatca tagaacatcc caagttggaa gagacccaca gggatcacca tgtccagctc 44880 

ctggctccac acagcaccac ccaaaattca aagttgatgt ctgagagcgc tgtccaaatg 44940 

ctccttgaac tctggcagct tggggctgcc ctgggcagcc tgttccatac ccaccaccct 45000 

cttgttccct cgggctctgt cgcagtcaca cagagcagag ctcagcgctg cccctccgct 45060 

ccctgcgagg agctgcagcc gccaccaggc ctcccctcag ctcctctgct ctgggctgaa 45120 

cagaccaagg gctctcagct gttcctcata cacgttgccc tccagatcct tccccatctt 45180 

tgtggtcctc ccttggacag tctctaatag tcttatgtcc ttatattgtg gcacccaaac 45240 

ctgcaccctg tgctggaggt gcaactgcac agcacagagt agagaggaca accctttcct 45300 

gcactcgatg gcagtgctgg gcctgatgta ccccagggta tagttggccc tttgggatgc 45360 

tagggcacaa cgctcagtca cattcaactg tctgtcaaca agtacctatt ggcctgcatg 45420 

aggcctgtct gctaattggg actctattaa atcacatcac tgtgacacta ggtggcacag 45480 

gcacacatga tctccatgtt ccttaaggct gagtgaatca tggagaatgc ttcctgctat 45540 

cagtttttgg catggaaaga gaggagccaa accaccggtt ggttcaatgc cttgtgccag 45600 

gaataggtga atgcatcaat acaataagtc acgtctacag cacagccagg cctcatgtca 45660 

gcaatactgc tccactgtga tagctgaaag tgactataaa tgactaacgt tagtgtggga 45720 

ctttggtgtt agatgacgtg agagccatgc agtgaaagag aattagtgtg gcagagtatc 45780 

taacagtgca ggtagataag gcaggaagga taagtgtaag gaaagataag gagaaaggca 45840 

ggaaagtaaa acctctgttt ttctctagtt ttctacctgg tgaaatgatg aagaaagatc 45900 

agtttgacat aggttaacaa aaactgtcag taagaaaggt aggagttaag atgcatgttg 45960 

tccaaatccc actacattac tttgaccctc ttcagcatat gcacaatgag atcacttgcc 46020 

caagacagga cctccagtgg gcatgaaatc tgaaaatcaa ttatttgcta tttgtgttgc 46080 

ttatcatttc cagatgaaat tctacacgag ataattagag tgatgtcctt gaagatcaac 46140 

ctttttgtct aattaaggta tttgctatag cttccagatg tattgcttat ctatgataaa 46200 

tatccttcct aactacaagg cttctataat aagagtaacg tcctctatag taaccagtag 46260 

aaagtaggtg gaagctgggt gttcttagac aacctgtgcc catacatgga caaagtgagg 46320 

aggaggacac ctccctaaat gaccaccaga gaccactgaa gacccacatg caagcacaga 46380 
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agattcagat gtgttggtgt aaccttgtaa acgcagtaat ctcgtgaata tgtgatagat 46440 

aggtgtgcct tatgtattag ataggcgagt attgagaact tttggtttat ggatgtggat 46500 

agtgctgtta tccatcttgc accctgagca taaataaagc aatatctctt ctatagtgcc 46560 

ttgtcttttc attgtatttc aggagacttt gaaactgaca acaggcatgc agcttgggag 46620 

tgctcacagt cagtctggcc acagtgcctt caagcctccc ctgcactggg atgtggtgtg 46680 

acaaaaagca caaacactgc ttttgtagaa gacccagacc acaggctgca ctagggaacg 46740 

tgtctgcctg gagcacagtg ccctggggag tgctgctggt acagtagtcc tggatgagtg 46800 

gcttccttct gtaacctttt aattgcacta gaagtacacc agcatggcag agaagggctg 46860 

ggtcctaaga gcccttcttt caaattcact cagaactcca gatgtttagg cagggtgttg 46920 

tagctgtaaa gtccaggaag aaaaggttta aagctgtact cggcaccaga aagactggag 46980 

ccaaaataaa gccacattgc acccatggca ctataggcaa agggtagcct tggggcaaga 47040 

cttgatgtac tagaagttga ggagtcctca gactctgtgt caaggggatg tgccacaact 47100 

ctactgtgcc cctacctgaa gcctgaatca gtacaaatgt ctcacgcatg ggttaggcat 47160 

ccttctctca aagctcttgg tctttgcaca ctttcttctg cagctgcagc agcagccaaa 47220 

ggaaaattag gtcttgcttt gaaagccagc cccttccagc catgactggt cccttctcac 47280 

tccacatctg tggatgatgc tcccacagca ggtgggagag acagaggctt tcttgaagaa 47340 

acccagcccc tctaggggaa cactgtaaag tcacagggga ggagacgtgg ctttgagaca 47400 

gtgatatact ccatgcccct ggcgttcttc ccctgagtgc cactggtgct gctcagtggt 47460 

cacatgccac caaagtctgc attcatcttt aaatgctgct gagaattcaa cctttgataa 47520 

atcatctgct ttgacaaaat cgacatttaa aaattaatat ttcctcttcc atcccctact 47580 

tttacaggct ggctcaagaa aatgggaagc ttaatgtaga cttgggtctt actaaaccat 47640 

ttcactggga aagacattca cagtctgtgg cagatggtag cagtatattt tctctcatag 47700 

tacaggaatg ggtctggtag tacctctttg gaaaggaaaa tgtaaactca tacgttttga 47760 

gccaaattcc atcagatttc ttagttttgt tagttttcac tccactcctg ctggaaactg 47820 

aaaatatgga aatgcttgga aatttactgt gatttgggtt caggtgtgtg tatgcaggaa 47880 

atgtgttacc ttccagagta agtcagttta ttctagaaat gggatgactc cacttttata 47940 

cacttgtaat tcacagtgag attaatccag ccaattggga aaacagcctt tcttaaattg 48000 

tgaaaaacat gctccacttc tatgtatttt ttaatatact tcagcattgt gaatttgaag 48060 

tttttcttct actgttacat gcattccaac agaatttgtc aggaacaaaa atgaaatctg 48120 

aaataatatt tttcttagct ttgcatgtgt tatcctcaag ggtaatcact gtcctaaaca 48180 

acatacttat ggctgtttct gagcctttct tcttcatgaa ctcatcagaa agggacactc 48240 

atattggcag tctgtataga gagccaagga caaatatttc gcctacgtct tctctgcgta 48300 

gcattttata tattaggtct tgctagtgaa ttatgactga atggaataca gtcccttcag 48360 

tgatgacttc attcatgatt gaataaatgt agcttcaggg ctgtatggtt gacttacatc 48420 
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atccaatttt gccatctgca acagccaaca cctctaccca tatatgaatt cagcgaggga 48480 

ttttgtacta tgtgttgctg ggatgtagca gcatttctct ttgaaatgtc tttacagatg 48540 

caatgcctag caggcttaac agccctacct gcttcagaga cactgctgta aaaagaaaaa 48600 

gagaagcttc ccagccagta tttcatcaag ttaaaaaaaa tctaaaagtt tatactgtac 48660 

catttggatt gctgcatgtt gacatcattt aggattctga aaacctaaag aagctttgga 48720 

gcaactccta agtgtatggt agatgctctc attatgtaag agtgacaaat cactaccagt 48780 

cttccaaaaa tgcatgctga aatcaaaaaa gaaataatgg atctcacaaa actggatctg 48840 

cagatcaggt tctacagcct ctggtatgca agggttaaag tagagtgatt gttgtagctt 48900 

gtgtctcaca gtcagacata aatctgtaag caggtccagg ttttgtaaat tgttgcttat 48960 

caccacatga gcaataagta atctgaacac ccaatgtaac agatttctag gagttagggc 49020 

tgaaagcatc atgaagttta ttcttttcta cagcaaagca ggctctgtgt acctgtctag 49080 

ccacattgtc tctgacaaaa tttatcatca attctcatct ccatcaactt ttaagaatta 49140 

cagaattgaa gggagggatt gttgaaaggg atctctggag atcatctagt cttaccccat 49200 

gatgaagcag gttccttaca ataggtggca taggaaagtg tgagcaaaca ccctgctgtg 49260 

agcactcagt gtaaaaagaa agcctggagt agagaccaac atcaatctgt attgcatcca 49320 

aaccagaaga ggcaaaaaaa gtgtctcact aagcttcaga aagtgtaaac aattcacaga 49380 

agatggatta ttgtggagag agtaaatgtg tgcaattttt attttcccca atatgtcacc 49440 

attacaaagg aaaatcatgg aatggtggag ggtgatggag gcctagcctg gggccccaat 49500 

acatgtagca gtggacagtg aggtcaccga ccaagcggtt gtgatgtcag caatggaaat 49560 

gactgtgacc tcgctagccc tcactgtaca gatttgggat ctggcagagg ccagcgtgca 49620 

cttgtgcctg gactcccgtt gagcatagct gcgagacttg gagcagtgag cgagttggtt 49680 

gagttgtgct gtggggctgc tggcagcagt tcttggtgcc caccccacag taccaccagc 49740 

gtttccccca gccctgcctg cctcaggcag ctggggccac acagggtgca cttgtagcag 49800 

cagaggtgag tggtacagtg gggaagtggt ggggaagtgg gagggtttgc tgctgaggga 49860 

ccaggacatc tggacagctg cctgcccatg ggacagcgag tgaccatggc ctctctctct 49920 

ctttgcagtt cgtaacacct tctgcctgct gcagcacctg tgaggggagc agtttcctga 49980 

cctcagctct cccagcccac tgcacagccc ggggccatgg acgtgccgtc caactggacc 50040 

tgccccatct gcgggcaaag tcgggaggat gtcacctatg tgaccccctg ccaacaccag 50100 

ctttgctatg gctgtgccat ctggtgggca gagaagaagc cgagttgtgc catatgtggg 50160 

caccaaatca ccactatccg atactcggtg aggtcggatg acgattacct cgagtgtgct 50220 

gtcccgcagc ccgcagcacg ctcagatcac ggcctgcagg acgagcaggg gcctgcagag 50280 

ccggtgctca tcccacctga gcacaacttc cccgccgagg tctgggctgc attttttgat 50340 

ggacatcccg aagacctcga gcccctgctc cactggctgc aggatgagat ccagcagttg 50400 

accagaaatg ggtggtgggc agtgtgtgtt ggacagtgga ctgttgtagg cctcctttgt 50460 
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attttcggac 


tggacgagga 


ggccttggtg 


caggagctgc 


agccattctc 


tgatgctgac 


50520 


ttggtgccct 


ttgtaaggcg 


gctcatcagc 


accgctgcag 


ccctgtacgg 


cccagtgatc 


50580 


cgccgccagc 


tcgaccagca 


ggaaggctgt 


gctgcaggac 


agcgggagga 


cagccccgca 


50640 


gccagcccca 


gcaccaccac 


ctcccatcgg 


gagcctcctg 


ccttgcgccc 


aggccgctcc 


50700 


accagtcccg 


cagggcccag 


caccgaggag 


ctgcccggca 


gctctactgg 


gggagctggg 


50760 


caccccagca 


ccaccaccgc 


gccctcagtg 


gaggagccgc 


aggaggagcc 


atggcaggcg 


50820 


gtggcagcgg 


gcccctccac 


ccagggcagg 


gatcgctcgt 


gtggggggcc 


ccggcgcccc 


50880 


ccgaagagga 


aggcccacag 


cagcccccag 


gcctcacccc 


cgccccccaa 


aaggcggccc 


50940 


cgacggcggc 


gctaggccgg 


caccgcactg 


ccgtcagagc 


acggctccag 


tgggctggga 


51000 


ggccaacatc 


tacctctcgg 


cctgctgctt 


gcagataaaa 


tgtggggatt 


caagaaagaa 


51060 


tatttagagc 


acaagctgca 


gaacaagata 


aacagcatgg 


gaaaggaatg 


ctgaggacag 


51120 


aggatgcctc 


caagagagaa 


gaaagtcaag 


tgagctgcat 


gatcgctgcc 


taacaatcct 


51180 


aattggaaga 


agagtatgtg 


gctaggaatg 


actcataact 


ctgattggag 


aagcgcctgc 


51240 


atgcgtggtt 


aaggagtaga 


acaagagcaa 


gggtgaccct 


gtgggatgtt 


ttgttgacat 


51300 


gtaaaggggg 


tgggaaagat 


accagagaaa 


acttggcagt 


gtatttaagg 


gatattagaa 


51360 


tatgcaataa 


atgatttgga 


ttgctcatac 


atctgagtcc 


gtgccttgga 


tgctgcaaga 


51420 


aaataaacag 


aaattcaaaa 


aaaaaaaaaa 


aaaaggataa 


gaaaatgtct 


ctgtgttatt 


51480 


gacaaggctg 


tgggcgttgc 


tgtcctttcc 


catgctgctt 


tctccctctt 


tttctcctgg 


51540 


aggtgagcac 


agacatgcag 


ctttatttcc 


atgaccataa 


attggctttc 


atgacagcac 


51600 


taaaaaaaca 


cacgagggct 


ccaacaaaca 


gagaaaggaa 


cttatgttac 


tctaataata 


51660 


atccaataat 


cagggcttca 


ctaatttcct 


ctcatactgc 


cagctccagg 


ccacagataa 


51720 


ttaagttttg 


tttgatttca 


gtgactgagc 


tgtgatgtca 


ccctctctgt 


agacttccta 


51780 


ttagtctgat 


gttaaaaaca 


ccaaaaatat 


gtgctgtaat 


ccaaagagaa 


attatgggtc 


51840 


ccattaaatt 


ggtactttgg 


gttctacagt 


ctctgttatg 


caagagttca 


agctaaatga 


51900 


ttgctgtagc 


ttgtgcacga 


gttttgaaaa 


gataccaatc 


tgtgaacaga 


cccagatttt 


51960 


ctttctggaa 


ttctcctccc 


ctgtgcaaag 


gaaagcacat 


tgttttttgc 


tctcatcaga 


52020 


gagtactctg 


aaatgaacat 


ttttgagtta 


gacagtgagg 


agcagaaaag 


aaattctatt 


52080 


cacataggtg 


cttttaaaag 


cattaccaga 


ttcttctaga 


caaatgacag 


aggaataact 


52140 


tttgccattc 


cattacacaa 


tagaataact 


gaactgcaaa 


acaaagagtc 


acgctacagg 


52200 


agtaagtttt 


gaaactgact 


tgcttacctc 


tgatgcttcc 


agctgacttt 


ctccattctc 


52260 


acagtagatt 


caaagttctt 


ttttttttaa 


ctgtgtgact 


gtagagagta 


gtgttcacaa 


52320 


cttaactgca 


tgctgtgcag 


tctgaattag 


agctgggggt 


aggtgataac 


acacctcctt 


52380 


caactgtttt 


gttttcctga 


actgtggttt 


gtctcattat 


tttcttctaa 


atgctatttt 


52440 


aagcagtaag 


agtttaaaca 


tgccttctgc 


ctgccttaga 


actgcagaag 


accttaaatg 


52500 
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cagaactctt actgttcttg aattcatggg aaggtctgag gaaatggggc catccaagat 52560 

gtcctccaaa caatacgttc cctcattcca ttatgtgtaa ggtacagtgg tgttgtacca 52620 

gggggtgagc actgcagtgg taagtgctgt tggacctgtc gtgcaagaat agaaagaagt 52680 

cccacaacag ccaaagtcca gtggctggac caggagtagc caactatgtg gctgctgtga 52740 

tttgatccac accagatttc caggttagca ttttcctctt agacccatcc ttattaatcc 52800 

ctaagccttt taattagttc ttgtatggaa agtagcagaa actgtatagg aagtcattta 52860 

tctttctctt catcctagcc actcttacca gagtaatttt catcttaagc aggaagctct 52920 

tcaagccagg ctattattcc atcataaact gtctataatt cttctacacg tatgacattt 52980 

tgtctacatc ttccaatatc tgtctcacta acaagcctgt ttctgttttt tatccacaac 53040 

ccatcgaatt tggtagccat ctttgcagtg ggctttggat cttgacccaa gaaaggaaaa 53100 

cggaagggta tttgcactgt cacaagttcc tatagaccta attgcagctt tccaagtcac 53160 

ttatgcctgt tatgtaaatg ttaacgctat tgtggagttt attaactcgc tggattatgc 53220 

atgaagtatc ctctggagtt tccccatcaa gcttaatggg accattagat ctcagaagaa 53280 

tgacgaaagc tatttctcag tagcttacat attacctggg tagatgtaat gggaaagaga 53340 

aaaagaagca ttctgttatc aattcctagc actttctttt gttaaatata ggctattttt 53400 

tttatcattc acaatttttc ctacttttcc tttttttatg gcctagtatg ttctgtgctt 53460 

tgttacacaa atctagggat cctgggttag tggtgatatg agctgaatca gctgctgaat 53520 

gtaggaatag ctcacttgct ttcatgggtg ctaatcagtt tacattagct gaggttcagg 53580 

gccattgttt gttaagattt acatctggat gtcaagatgg gtttgcaggt ataactttta 53640 

taagtgactg gtgagacagc gacactgtag ggtgttttac ttcgagtaat gcagaagaat 53700 

gtacactgat tttgttgcct ttagcagatc tgccaaatac caactgaaga agcaagaatt 53760 

aacatgtttg ttcctcgtct tagttgcatt cagggacaag aaaagctcca tcctctcctg 53820 

aaaatacaca gctggagaaa attcagacca tggaggcaga cccatttcca gtgtctattt 53880 

cagcaaatat tgactctaag cttttattgt cctttaatat gcatatattc atgctgtgaa 53940 

tctatgctga agaactctgg gaaggtgtgt gccctcaccc acattaatca cccagacact 54000 

tcataactac ctggattaca ggagaaagtg actcatctac tgatgacgct ggataaaagc 54060 

aagaggggaa agaaatcccc agttctcaca cctccctcct ctgcacatag taaggaggca 54120 

ctcaagggca taatgcaaac ccagagctgg aagggaggct gtgtgtcagg gcccaggcct 54180 

gctgctgtgg gcagcaaagg ccatgtaatc gctggacatg tcagttccta ctgctccgat 54240 

ctgaaaccag ttcagagtta gagggagagg tctgcttggt ctctctgcta ctcatgggaa 54300 

aagcacttct gccaatgtga tctactctct cttcaaagtt tccacattgc ttacgtgagc 54360 

aatcctatcc ccatgcaggc tttcttttgg taggtgagcc cctgataatt cgaaaatcac 54420 

acctacactc agctaggagc taaccagatc tataatcaag cacagtatgg tgtggctatg 54480 

tagaggaagc atcttcaata aatgtactgc agtggtaata tgcttttaat aaggcaactc 54540 
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tgttcacatg aacagtacta gagagaagca caccaggccc tgaaacttca gggcaaacaa 54600 

aggtttgaaa gtatccctga attaaaataa ttgaggaaag gtgacaaacc taagcatgtt 54660 

tgggtttttt tctgagacaa gcatcgtgta ggttgttttg agccctagtc acagcctggc 54720 

aaaaggaacc tggtgcagtc actatgggca ccagagagga aggaagaaca gtgttgtccc 54780 

tgtccttgtg agaaggagct ctgaaccaca gtgcacatgt gtgggggttc acaatactgt 54840 

cttcctggaa ggactggatg cttcagtggg aagtgattaa tccaaagcac tgtctctctg 54900 

catctgattt atctgtgcca taccaaggcc agttatgccc agtgctaaga gttgggagca 54960 

atgtctttag ggaaaaggtc agatgccaaa tgatctgatt ccagcacttt catttcatct 55020 

ttcatttcct gttctccagg ttaaaggcct tgttctcact gaaagctggc aactgtttgg 55080 

ccgcctgtta tttcagagtt gttttatcat tattattatt ttcctgatga gatgtatatc 55140 

ccaaacaaga acaggctcaa taaaataaat gaatgaaatt aatttcctgt ctttgattag 55200 

aaatattcac tggtgaggca cacttctacg tcagcagaca tgtctgcaga aggctgagtt 55260 

cttgctggac gtgttgaagc agtgtgttgc tgtgtgacac catccctcat ccatctcagt 55320 

gcagatgcct tgggaagaaa gaggaaaaaa gagaggtcag cttgctgctg ctcagcttgt 55380 

gtctctctca gtataatcct agaatgacac ttgattattc taagtgctat tgtagttgca 55440 

aatcatcgtg tggtttgtaa ctgtcagtct gacctttact agacatatac tggaaaatat 55500 

tcttgcctgt gacttctctc tattgctaaa taatgatcta gacagataca cagtgaatac 55560 

agaaagttca gttgtataga ccaactgaca gacattgtga ttttaccctt tgttttttct 55620 

aagtgtgccg aggaagcagg ttgttgtatt gaaataaagg catgcaaata acctgctact 55680 

ggctccctcc aagatctcag gcttgctgta aaagccgtag ctaggtcaaa aggggttgca 55740 

ccttttgtga ctggcagcat aataaacatt ccccagttta ttctgctcat tattccatcc 55800 

cacttgtagc caatttcctg tgtggtctcc aaagcatgaa atctgcaatc agacatgtcc 55860 

tgagtgtcaa tgcattaggg aaataaaata aggaaaaaaa gacaacagcc gtcagttgga 55920 

gtctgtgaag gagctgagct ggttcataga actgtgttga gcagcaggag cgttccttgc 55980 

cccaaacgag cctgtccaag gggtgggagg aaggaagctg tttttctttt ccactcagca 56040 

gcgtgtagca acaacgccct gaaggagtgg cagggtgagc cgagacctgg ggctgagagc 56100 

agacaggatg acaggagtgt taccaggtgg ctgcatccct cctgcacacc gcatggccag 56160 

gtggtggcac atggggatgg ctgctgcgtt ctggtgccca gcaaggcctt cgcttgtgtt 56220 

tccccttggg ctgttgaaga cctgaaaatg ttggtgacct ggaaaaggaa gatgaaagct 56280 

cctctgttgc tttgtcagaa gactgtggct tgtccttgtt atgaaacttt ggactgcaat 56340 

aatgacggtg tttcactttg cacatccctt gttcctatgt tgttttgcct ctgtcttttt 56400 

tcagagacag catcatgaca ggagatgtcc aactcacagg gagctttaaa acagctttga 56460 

ttttattatt attattatta ttattattat ctagtaaaga aaaatcctgc tcttcattct 56520 

gctatctttt taacttgatt aaaaaacagg ttgcaataga tgtgtgttga aaattcttgg 56580 
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aggtcaaacc aagcaagact aattcggtga caggtaaatg caggagatgc acaagctgat 56640 

gcagttagtt agatgtcatt cagtcagact gaggaagatg aactgggaag ccaagctcca 56700 

gtgcttgtcc cttgcaaacc tcaggtacct aaggctcagc atcttctgct tttcaagtca 56760 

gcatcttttg ctttactgct tcttcctggt tgggccataa gtaagatcca ggtaagtgac 56820 

aggcactccc ataaatacta atgtgaagat acatataaca tatacataaa atgactttag 56880 

gatgttattt gtctctatat gtgacaccta atattatttc aacattatct ccaactgtaa 56940 

attaacccca aatatccatt ctctggggaa gcaggactgc cctgtgagcc actcagttac 57000 

aggagcccgt gtggactcct ttctttgaag catggcaatt ggttatttct ctcatggcac 57060 

tatgatcaga aatcagtcat ttctcagcag cagctgggtc aggcagaaga gttctgtaag 57120 

cctacaaaat gccagacctc agctatcaaa gagaagttac agagtggagc gcagtaatgt 57180 

tcatgcacct tcagctttgt aatgtagagt tctttcacct ttccctgctc agcttttaga 57240 

agtaaccagt gcaccccctt gtatttgtat ttccagccta catggtgatg gagctggtga 57300 

tggtgatagg aggtgaaggt ggtgacctgt gctatctctg tcaaggcagt tgagttgtca 57360 

gccactcccc accagtgtgc atccagaagc tcctgaacca cttcacagag ggccgtcttt 57420 

agtagacttg ggcctgaagg aaagtttgtt cctttggctc tagattacct gcagacatct 57480 

gagaagagtc tggcagccaa cattattgca gatttttatc aaaatttctc tcatagtgta 57540 

aggcataatt agatatgtga caataattgc ccagatgaca catttttgcc agctgttctc 57600 

gatgttcttt gaattgctta aacatatcca caccttaatg tttgttatga attcacttat 57660 

ggtaaagcat actaatttta tgttcactgt ctggtaaata aatgaagttg aaccctactt 57720 

catcacagca gaaaggaaaa taagatttcc tttgggtaat aatgagtctt ttggaagcac 57780 

atattgctgg gatctgatca ggaagcagtg gtgtgattta cactcaaaac tgtcattcca 57840 

gaagtagata actcctcagt tgccactgcc accaaggagg tatcacatca gggaagacac 57900 

gtactttggt ttcctagccc tttcatttgc cagcatagca atcttgaaag caagctcaat 57960 

aacgtttact ttttttgtag cacatcagta gtctaagggc atatagggct gcctggggtg 58020 

ggtgaggtgt ggacagcagc tgtgtttgtg ggtggcagtt tcctcatgaa ggttgacttg 58080 

gactccagga agtccttgtt agtgtggtgg gcagctagac ttcgtgatgg gagaagcagt 58140 

gtggagcaga gtaggccacc ttttccttgc taaggactac atacattttt caaatcaaat 58200 

aagactgtca ggctgcacca ccacctttcc ctctcccttt cttcaacccc ctccgcattg 58260 

gattcatcag taatcatgtg gccaatgagg cctgaaccca gtggttccca ttctggagat 58320 

gctggaaagg agctgtggct ttttccaccc tcccagctga aatcatctgc ttgagagcaa 58380 

cacggtgagc agacatttct ctttgagcac gagccccact ctggccttgg gcttttgagg 58440 

caaccaagta caccacacaa ccctcacaaa gacaacactg agacactatc ttaacgcaga 58500 

cacagatgca aagtactggg ttgccatcac acgttttggt tgccttccct gtgtgtcttt 58560 

ttcacaatta caatttttta cgaatcacag caattccaat gtaccttgga gcactggttt 58620 
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tgctcagaag ctccttcagt tgctccttca ctcctgcacc ctctctttca gagtctttgc 58680 

tccctcctgt ctggtgccta ttgtagggca cttgctgcag gcaacgtgcc gaggctgctg 58740 

tttcctgctg gagcatgttg ttgtgttcag tagctaggca ggcaaagaca actcaggtgt 58800 

gactcatact gctcctccag tttaggagtt gcttgaagaa tgattaaggg aaagaaaaaa 58860 

aaaaaaaaag aagaaaaaaa aagcgagagt gtgtgcgtaa ttagggctgg gaaagtggga 58920 

gtggtgtcaa tgagcaggca agctaacaga ctcttctgct cagcaaggtt tcgccacagg 58980 

tctctagttc ttcgtgttgg acccatttca ttgattcatt taccaggatt atacccttgc 59040 

aaagtaagta caacattttt tcttcctctg gtactttgat gcctaactaa cagttataaa 59100 

atctctgccg tgaaacgtaa tcgcatagag tgagcaggat aaggaagctt gtacaaaaag 59160 

acagcatgtg aaactaaagg tggaaaaaag cctgctcaga ctacaacctt ctggttttgt 59220 

cccagcattg ctgtgtttgt ccgtctacat tccttacccc tgaaaagatt cgttgcaaac 59280 

ggtgagtggg gtctgtcagg tgagactgcg aagacattag gaattattag aatatttaga 59340 

ctacctgttg gctgttccca atggcttttc ctgaatcaga agagcaggct gtatatgatg 59400 

ccatgaaaat gctccatatc tctaagtagg tgtagactgt ttgagaagtt tacaaaagaa 59460 

tattcttctt gttttccaca tgggttcgta tgctttgctc tgtttgcttt gtctaagcct 59520 

tgctagttca aaggacaaga acttaagtct actaattacc tacttgatct tcagtgtgct 59580 

catccggttg gaaaaattca ctgactcttg aggcacaata aagggtattg tggagactct 59640 

ctaattcctg gtgtgacttt ctcaattgtg ttgctgatgg tgctttttcc acaacctgat 59700 

gaacactctg atctcgctaa agcaaagcat cagtctgata ttgtgtgttc ctcagagaaa 59760 

catctgttca gaggaaatca tgtcttagtc acggagctgt gtaacctgcc tggtggagag 59820 

ctgccatttg tgtagaagta ggaggaagag gctcacaaga gttttgttcc tttatatttt 59880 

gtgttatcca agcaagagct ccagtaaggt catgttaaat gagctagttt ggagggggaa 59940 

tgccccacat gtgggttctt tcatatcgtt tatctaaact gaagtgactg cagggtgttc 60000 

actcacagct catgagctgt agccctagtg cacagcccca atagcagccc agcttggatg 60060 

gccaccgccc ggtctgcccc gggtgcgcac tcctcagggc tctttaacaa aggcaagaat 60120 

aaaataaata cttgctctgc tttatcagat gatgcttacc attcagctga cgtgacttgt 60180 

caggtttcca cacagatgtt gccgttctcc tgattaatgt tcagaagata aactacattt 60240 

agcttttctc ttagtaagca aatagcaaac aaagctttgt ttctgttggt tgcattcagg 60300 

agtgacaaag caaaaatagt gtcctatact actaaacacc tttaagttat tttttttctg 60360 

cactgattct agagcctctc agcttcctcc tgtatctgaa cgtgtttctt gaactctgtg 60420 

gccccatcac agctttaagc aaagctgggt ggatcacagg ctgcatgtgc ttagaaggtg 60480 

ccaccgtgcc gcgggcctct cagaatgctg acttgttgct ctcctgggaa agcagggatt 60540 

cagccagaat ccaagcagcc cttcttgaaa tttcatttcc aattttgttg actcctccct 60600 

gtgtgagagt ttcctgtgat tactgactca ggagctgtgt ctggtttctg ggactgctcg 60660 
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tgggcacctc atgggctttc gtcttgagtg ggggcctcag cccttctcac tcagccagaa 60720 

cttgctgcag tggggtcact gacacagctt ggggtgctca gggctttaaa gaggttcaag 60780 

acttcgtaat atttcatgca gtaaattcct ttcaagcatg tgaacgctgt gagctcctat 60840 

gtgttgtatg tcattaatga atgcagcatt aaaaaagaag gctgatcaga tgcagttaaa 60900 

aaagatggtg agatagagat tattctttgc tatccagccc ttattgaaac agcagggtga 60960 

aactgagggt gtttttttcc caacaaaatc ctctgaatgt gcaatatatc agtagcagca 61020 

ctaaaagaaa gaaagtgata agccttgcca ctaccaggaa tagattctct tggcataaca 61080 

aaggcattga gaagcatcat cagctactga gtgaacagga ggactgtaaa aggttcacca 61140 

cgaagtacct ccaggtttcc tcactgaaga ggaacacaga aaccttgcaa aaacgatcca 61200 

gcttgaatgg taccagaaaa gaatttctac gtcctggtgc agaattccac tggtgtaagg 61260 

aagaagagag tcatttaagt ttgcaaaatt tcacaattta tttccttgct ctgaatattt 61320 

tgccacccag gagagtgaag cacaggtagc acatgcacat tttaatatca ctgtaggtca 61380 

tttgccaata cgactgaaaa tgctgatgtt agaaaggcag gattgcattt ctggcatgaa 61440 

gacagaaagg aacgtgaaat gttttgaagt tattatgatt gcatatattt tcttaggcgg 61500 

taaggaagat ttggaagtca aaatagcatc agggcagccc taactgaaga aggatatttt 61560 

actccgctag caaatgaaat atttttcagg tagactgcac acatcattct ggcattgtga 61620 

gattatgcgt gttgtttatc ttcacgagag tggtagatgt tgaatgacac attcttggtt 61680 

ccttgggtaa ttttccacgg tctccccagt gagaaatgcc tgggaagttg gtacttgccc 61740 

atttcttcca tttttacttc agacagagaa agtatgcata tggattgtgt gctcgtgggc 61800 

cttaaagtgc ccttaaagag aatgagttca aagggaaaaa taaggtaggc atcctgttca 61860 

gagcagtttg tgtaaggtgc acagaagtgc gtgtctgtgt tgagcgagtg cagaaaggca 61920 

ttttaaagga tgatttcaca tgtgctcctt tgacctgttg ttccaagtga ctccctcagc 61980 

agcagtccca ggtcttctta tttgttttca ctgtcttttg ccaccatttt gcccaaagct 62040 

ccctcctcct ttgatgtatg cggagtccat cgtttctagc aagcttgact tttctggtta 62100 

ttagttgctt ttatatgtga gaagttgtga ccacaggagt gacacaggaa tgatgcttgt 62160 

agtgctgact ggcactgagt tctcactttt acacccagaa aaactctgag aacacttccc 62220 

aaacctcact ctgacaccag cttgattcct gctgacactg taaaatggga tctcccaggg 62280 

taagcttcgt taccaagcat cttgggacac tgccagtgtc aagggagatg gacagaccca 62340 

ttctgcttga aaagcatctt acagggatcc tttacatgtt gtaaacatcc ttcttttcat 62400 

ttttatttgg ggataacttt ctctggtgct gtatatttaa ttttttttcc tcctcaagat 62460 

gaattgcttt ctttgcgttc ggaggcaatt aggaaatact ttgttgctga taccaacagt 62520 

cagagcactg tgtgagggca cactgctggg taagtgtgtt tttcaaattt ggatttaaaa 62580 

agtcttgatt ttatgccatt atcctttttt cacttaatta gattgtgcat tatatttcag 62640 

taaccttttg tacagcgtct tttagctaaa attaagccag gtgccttact aaatatatag 62700 
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aacatatacc tatgtaagtt aatgaaaaca aagacgtgaa ggccttttct aatcaaacag 62760 

attttacatg gaaatcaaag ttttctcagc tgtgttgcag aaaaaaaata cccccctgtt 62820 

ctgttactcc tataaaaacg tgtgaatacc acagattatt ttggaaatct ctactctcaa 62880 

ctaccaaaac tgccacagca tctcgataca ttgatgtctg atgttcagcc aagtttggac 62940 

agtatgacac atgctcttga atgcagattt ttgtcattca aaacaccatt ccaaacaggg 63000 

atgagagtga gcggtcagaa gcaggtgtcc ttgctctgga gacagttccc tgcccacatg 63060 

tcccctcttc cctttcctgt cttctcttac ctaactgctg tcatctggtg agatctttac 63120 

tcatctgatg caacctagaa tgcaaaaggt atgaactagg taaatgttta agactgcagt 63180 

attaagtagg catttgagag aaatctctgt ccttaaggtg cttcttggaa gatcagcaaa 63240 

cctctcaccg aggtaatgct tcagataatg ctacagactt tcctgtttgc gtcttctgtg 63300 

tcagagcctg aaacgttatt gcaaatagat gtctggataa gaacagaact gttaaaatca 63360 

ccttgccatg ccatataagt tccaatattt tgccattttt tttcctgggc agggaacatg 63420 

ttgaagaaag tttttgagtt ctgttggaag tctttccctt ttgaagtccc ttgcagtatt 63480 

catcttttcc ttttccttct gtctctttca atagacagag ctgctgagca ccaatttatc 63540 

agattgtctt tccccttctt tagggacatg tgattctggg gatagaagac agtcaaactc 63600 

actgtgccaa aggagttacc gtcttccata tttgtgctgc tcttaagctc gatgcgatat 63660 

tgactgaaat tctgtggttt ccctttgttg tctttaatct acaccaatgg agttacaccg 63720 

aagtgcagtt ttagatctat gaaagcagtc tggaagatcg aatattccgt gtcattccca 63780 

gaacgtggtc cagaacatct gtcgcttggc accacctttt ccattcctga ctgcatagat 63840 

cagctaacag ccctacggca attgcagtta ctctgaactg ctaggaaaat atttgcagtc 63900 

atcattgtaa gtgatgagtg ggcacatagc agtatttatg taggaggcta agtacttaga 63960 

gtttctagga tgatctcaac ctacaggacc ggacagcttt ctggagagtt ctagcaaggg 64020 

taaggagaac agggaatcac ctcttagaga gaggacatgc cacagctaaa gctttaatga 64080 

acaattagat gtgaagcaag agacaggaaa gatgattgtg agacttttaa aagcctatca 64140 

aagcactagg agagcccaaa gcataggcaa agtaccttat aagttggcac atctgaagag 64200 

tatcaattaa aaacatatta aatccatatg ttatccgatg tgattcaata tgtgtgggtc 64260 

accctgacca acccagattt ctccacgtat gtctggtaat actggctcta cgtagcacgc 64320 

agaactgcca gctgtcactt gaaggtaagg gcttctactg agccactcgc attaccttgg 64380 

ttgggcatgg atgagagact cctcaaaagc tgctggtggt gtctgagact gggcaggatt 64440 

ggtcaggcct ttctcgcctc ccagcgtagg ttcaagctgc ccagtcccca aactggtgtc 64500 

cagcctcctt cagcaaggaa atcagtgacc tgccagcctc actgcaacag gagctcactc 64560 

tgtgggtcat ctctatcctt ttctgtttca ggatgacgat ggatgctctg caactagcaa 64620 

acactgcctt tgctgttgat atgttcaaaa agctatgcga gaaggacaga acagccaata 64680 

ttgtgtttgc cccactgtgt acctccacat ctttggctct ggcatataaa gctacaaagg 64740 
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gtgacactgc 


agaccaaatg 


aaaaaggtga gctgtcggca tcctgctgtg tagctgcaaa 


64800 


attgtcagag 


gtggctttcc 


tatttattcc 


tcttaatgct 


gtataggact 


gctggttccc 


64860 


ttgtaagcca 


ggcagaaaac 


tgtccatcca 


aaattccaga atatttcccc 


actccatggc 


64920 


tccacacaac 


caaagaggct 


gaaaatcact 


agcataggga 


aaaaagcttt 


ctcaagcatt 


64980 


tacaaggtgg 


atggggacat ggcagagtcc tcagcagttg tattaaggcc ttgtctcctt 


65040 


tcagcaggaa 


tgctgattgt 


ggctgaaggt 


gactgctgaa 


gtcactgcat 


tttctggata 


65100 


attgtttagt 


gattattcag 


gactgcctaa 


gcttaacagg 


actggaaata 


attttgccat 


65160 


taccaagtaa 


ttttagcagt 


tctgtctgtg 


ccatttcccc 


tttctcctgc 


catacagcta 


65220 


agaggaagat 


aatgcagtag 


gaggcagctc 


agcttgagta 


gtagtttgcc 


ttgcaaatag 


65280 


ctctagatgc 


tcaagggttt 


tacagcacca 


cgaagcagca 


tcatggtgat 


ggtgcaatga 


65340 


gtttatcaag 


gttgctctgt 


ggcggtgaga 


ggctgcacga 


ctgcctctgt 


gagagccagg 


65400 


atttacacag 


cctcttttta 


ttccagtgcc 


cacagtctca 


gcagttacct 


agaggtgaat 


65460 


gagaagcaaa 


ttcagcatgc 


atttatatgc 


tgattatcac 


ctggctctca 


ggggcattcc 


65520 


atgtatttga 


atacattttt 


cttcgtttag 


cagttcctcc 


ttgtaccttt 


ggtttccctg 


65580 


acggcacatt 


gctggagcac 


agcctctggc 


gcctctgctc 


atcctacaga ttgcaatgag 


65640 


tctatttgca 


cagaaacaaa 


gtggtatatc 


cacaaaggcc 


tgctgggtgt 


tttcccaaat 


65700 


aggattattt 


taaaaaaata 


aaaataaaaa 


tgatttttag 


atcttatttc 


tagtttaaat 


65760 


gacaccccaa 


agcttccttg 


tcatttcaaa 


gttcaagcac 


tgtctttgca 


atggaagagc 


65820 


ttaaaacatt 


aacctgtgct 


taatttcact 


ttcacttgtg 


cctgcaattt 


gcattgaacc 


65880 


gtcccacaat 


aagtgaacat 


ccacatccac 


aaatagggtt 


ctgttacaca 


agtgcactta 


65940 


tgtttcacat 


ttctcaaggt 


aatttactgt 


gcctgtaaag 


acatggtgtg 


ttcagggaga 


66000 


aagagcagga 


gtgaggctga 


aagggaaaag 


gaggtcactg 


atgctggttg 


ggaaagatga 


66060 


gaagggttgg 


gcaggctgtt 


tttaatggaa 


catgcactct 


cagagacctt 


gcaacaggca 


66120 


ggcacctaaa 


agcagagagg 


tttaggtcat 


gctagaatat 


cctggaactg 


ggcatgtgat 


66180 


ttcccggagc 


tgggaggtgg 


gtcagcagcc 


ttacctctaa 


cttacgttct 


gtctgccaaa 


66240 


gctcacctgc 


ttatctgact 


gatttctact 


gaaataccac 


atgacatcat 


gtgtcaataa 


66300 


tcagaaaacc 


ttgccatatg 


gtaagcagtt 


tttaaagaag 


taacccactt 


ccagaaagga 


66360 


aactaactgg 


aacatttatt 


tatctggcct 


ctaaactcca 


gatttttgga 


caagaatgtg 


66420 


agtttgataa 


aagcatgact 


ccacgctgca 


gatatgtagt 


tcactaaatc 


actttgctag 


66480 


tatgaacagc 


tctatggaat 


tctttggact 


gctcacagga 


aggaaacaca 


tttggttaaa 


66540 


gttttgatag 


gatcaagttt 


ttagatttat 


gtggggatgt 


caaataaatt 


aatttttttt 


66600 


ttagtaataa 


ataagagtga 


gaagtcgtgt 


tgttagcttg 


aacacaaaaa 


agtcaaagct 


66660 


ctggtcacaa 


acaagcatta 


tttattgcca 


agctgtcagg 


cctggagcat 


gtccagagaa 


66720 


ggacaacaaa 


gctgtgaagg 


gtctggaaca 


cagatcttac 


aggagagcag 


ctgaggaaac 


66780 
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tgggattgtt cagtttggag aaggagaggc tcaggggaga ccttatccct ctctacaact 66840 

gcatgagagg aggctgtggt gagcttgggg ctgacctctt ctcccaggta gcattaatag 66900 

aatgagaggc cgtgtcctca agttgcacca gaggaggttc aggttggata tgaggaaatt 66960 

tttctttttc tgaaagagca gtgagatatt ggaacaggct acccagggag ctgttcaaga 67020 

actgtgtaca tgtggcactg tgggatatgg tttagcgggc acagtggtgg tgggttgaca 67080 

gttggactag atcatcccag aggtcatttc caaccttaat gatactatga tgctatgagt 67140 

ttttagataa taaaaagaaa ggtgctcagt attttatctt gttcattatc aggtgctcca 67200 

tttacaagac gtcaaagatg tttcttttgg gtttcaaacg gtaactgcag atgtttccaa 67260 

actcacctct ttctttgcac tgaaaatggt caagcggctc tttgtagaca agtcgctcag 67320 

ccctaccaca gtaagtactg cagaaaagtg cttgaattgc tcgaccaacc agacttcaat 67380 

gttattcaaa atacgttctc tcactattag cttttacttg actagactca gatgatgaac 67440 

agcataataa gagtttgtag gaggatgatt gttctgcttg accccaagca atgcagccac 67500 

tgctagagtt gcaattcttt cattaatatg ttttaggtca gtaggcgcag taggttttga 67560 

atgcaatatg acttctatgc cacatcaagg gctttgcaat ataagtatga ctgggaagga 67620 

ttttaaataa agatggtggt gcaagtgtgt ctagtccaca cacccaagta attactgcat 67680 

aaagagtagt tttcttaatc taactgagga ggcacaagcc tggttattca aacaacacaa 67740 

gtgaggaaag tgttgtttgg ccatgaaact taaggacctt gcaaacaact gagaaaaatg 67800 

ttgtgtttgt tttatcagag ttgcctttga atagggcccc aagcaagggc aacttcagcc 67860 

tagaagtgat gtttcagaag actcacagcc tgcttgaatg gtgttataat caggttgcct 67920 

gctttttggc cccatccaca gcagtgagca tctcacctga caaggatagg cacactgtga 67980 

gcagcctgtg gcctttgtct catccctttc ttttgcccag gtgtagactg aaggctactt 68040 

tatcctttca aactcaggca acatgttcac tcctgcagta cgaaaggtac tttagcagcc 68100 

agtataactg tattgaagac agtcttggga gcaatctgct gaatgcggct gcgtgtcctg 68160 

gctgtcacct gctgttactt attagctgtc ctttgtaata tactctctgc ctacaccgta 68220 

atgaagcttg ggatactggt tttgtaggcc gtgtggagag tcatctagtg aagaacatct 68280 

aaggaaggtt agctttggta ccttgtgtct ttcaggactt tgttaactcc acaaagaggc 68340 

cttttccatc agagctggaa ctagtggagt tcaaggaaaa aactgaggaa acacggcaga 68400 

agatcaacaa atctctctca gagctaactg atggtgagta gggcctaacc tcggggatgc 68460 

tgattacctc tttgaagaat gatgtctttg tcttcatgac atctcctaac tattgctttt 68520 

agaagtaaat atacagtgaa agcaaaggga ctgcacctat tatttggatt catgaggatt 68580 

agctgtgtta gcatgtttta aaatcattta ctttactact gtggcatttc tggaggcaga 68640 

ccttacatta gcctttggca aagcatctca tttgttttca ttgggaaagt ttggctcctg 68700 

gctgcagagc ttcacaaaca tctgacatca atacatcaaa tcctggcccc gttctctaat 68760 

ggagagtatg tgctgaactc tgaatttcag gctgttaatt agtagctcat ctcagcagca 68820 
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cagctgattt tgaccacagg tggacatgtg tttcttactt ggaaacactc ccgtggcaat 68880 

agttctgcag cacttttcct gcagtaccac tgagccacta agtcacaaga agtgcctctc 68940 

agtgaccatc aaggctccca ggcagaacct gcccagtctg tgcagggtag aggtctggta 69000 

cgcagtgccc aaggcagagc tcatgtacat gctgtccata ggtagctcca gggttggttg 69060 

ctgcctattg ccctcatgtg gtacacatat gaaaatatgg gtgcctgagt tacatctgct 69120 

ccatcccgag gtgacacagg tgcccacagg gaagtacttt gcgctgcctg tgtgatttgt 69180 

gcatgaatga agactaacat ccacaacact gtggattcag tgcctcatga cagtgtttga 69240 

acagacacaa aataaagcaa gggaaagaat tacgttctct ttttgaaatc catggcacta 69300 

tttggttatg aactgtaatt agatggttag cggcatttct tattcgggtt tattcttatg 69360 

tatcactcca aaagtgagta gaagctaaac tggaacttcc cttgaagtct cgctctccaa 69420 

atgagaaata tttttttcag ttctacctgc tgaatttcgc tgaagtttca gtaccttctt 69480 

taaagtacta aagaaaagca gtagacatat tttttattct gttttatgta aaccgagtaa 69540 

aaatgtcact tggaagatct gtcttgatcc caaattccat tttaaacatg gagctgcagc 69600 

taaggaacta aatgcttcta tttggggatt tccctttata attaaaactg ctatctgtga 69660 

ggtgcagggc agaaatattt taattcagta cagtgtttcc atgttctgtg aaaacagcac 69720 

atatgttgat aatttactgt attaatgacc agcttaacca tcttcacagg caaaatggag 69780 

aatattctga atgaggacag tgtaagtgac cagactcaga tcctcctagt taatgcagct 69840 

tattttgtca caaactggat gaagaagttc ccagaagcag agatcaagga atgtcctttt 69900 

aaagtcaaca aggtacgtcc tgaaataaaa tagagtacac cttctactca gatgaatgtt 69960 

tgccaatttt gtgctaagga aatttcagtg agagcaagtg aaaaatattt gttactacta 70020 

tggcattctt agactctctg tcaaaaccta tgtgctgttg caaaagtacc taagccagtt 70080 

ttcttgttac gttgctagtt tgaagctgtt ggtgaaacaa gcactaaagg tcaccgatag 70140 

taggtaattc tttcctttaa agcacatccc cagtatattg tattaagtac accttgtcac 70200 

atgaaaactg ctccccttaa agtaccaaca gctttcacta gcagtcttac agctgatatc 70260 

gttacttaca gaagccaaca aattccatga tggtaatcaa tgtaccactt tcatgcaagc 70320 

ttgcaaagtt tcctctctca tcttctctgt gaattaaaag gagtgctaga ttgtctcctc 70380 

ttgtgttttg cagactgaaa ctaagccagt gcaaatgatg aatctggaag ctactttttg 70440 

cctgggttat gtgaaagagt tgaatgttgc aatccttgaa cttccatgcc ttaacaaaca 70500 

tataagcatg ctcattctgc ttcccaaaga cattgaagat gaaacgactg gcctggaaaa 70560 

ggtgagagaa aaaaacagta ctgagatgat gctttccatg cacagctgtg tcggttagct 70620 

gtgggtagct tgggtaggga ctgtcttcct tgaattcctt cattgggttg ttgagctgat 70680 

tacatagcaa acgcttgtga agaaccagta atcagagtat gcacatttag tggagtttct 70740 

ctggaagtct actctatagg ttaaataatc attatatcaa tataactgag agtgtaagtt 70800 

aactctgaat gctacaagca aaagttgtct tttggacttt gtttttttgg ggtttgatag 70860 
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gactgatgag ttcagaaatg gtctttttgt tcccactttc tctggactgc acattaattt 70920 

cctttgttct ttatgtcctc agctggaaaa ggcactcacc cctgagacat tattacagtg 70980 

gaccaatccc agcatgatgg ccaacaccaa agtgaatgtg tttcttccaa agtttagtgt 71040 

ggaaggcgat tatgacctga agccactcct ggaaagcctc ggcatgacaa atgtctttaa 71100 

tgagagtgca tcagatttct ctgagatgtg tgaaaccaaa ggtgtggttt tgtcaaagat 71160 

cattcataaa gtctccttgg aagtaaatga acagggtgga gagtctctag aggtaccagg 71220 

atatcggatt ctgcaacaca aagatgaatt taaagctgac catccgttta tctttttgtt 71280 

taggcacaac aaaactcgca atgtgattct ttcaggcaga ttctgttccc cataagcaga 71340 

gaatattaat tatgaaaaag accataaatt tatggtgatg catgttcctg taaagcttgg 71400 

tgtcctgact atcacctttg aaaggaattc taagaggttc atatatcaac agggtaatac 71460 

aatgtactct acatatgcag cagaactagt ttatttcctt ttatttaatc cccttaagct 71520 

gaaggattcc cactgtgcag aacacatgat atttgactaa gaagtattcc atcctcatcc 71580 

acgagaatat tttgttcctc tgtgacatct ttttccaaaa caaaatgaac agagaacctg 71640 

tttttgaaag actaggagct ggaagaggct ctgggggaaa gagctgcatt cctgtttcat 71700 

atccaaaaca ccttcccttg agactcatac tcactgccta aagggggaaa atgtggacat 71760 

gtggtgtgat agccctcctc ttgtacttgg ctgtagtctg gtgatccagg gtgccctgct 71820 

ggaccctgct aatgcacggt gaaatagtgc agctgaacaa ctcagagttt gcatctgtga 71880 

aacagcagct gcaatatgga tgcaagaggc aataataaaa cacccagaag actcttcagt 71940 

gtgtgctacc tcagtttgta ggttggggag gttgcactct actgtgtggg attttttcac 72000 

tcattctcct tcagacatgg cagaggtgac cagttcactg cagctgagag gaactctgtt 72060 

gtatatatcc tgagaaaaag aaggctgtgc agttctagga tagaaatcac ttggattaat 72120 

attgaaaatg ccacactctt cagatacaga ttttctgtca cttctggatt cagcattaag 72180 

gaggctccac acgtctacct aacctctggg attcaagaaa gaaataaagg ctttgacttg 72240 

agtgagatta acactgtaat tagaagtctc caaattccat atgaaattat ggtaattagt 72300 

tttcctttct atctaaagac tgcctgctgc atatgttcag tactgattta cctaattact 72360 

gttcagaata aagcactaca aaacctgtgt caaatgtctg tagcacatcc agtgtcagtc 72420 

tgttctcctt cactcagctt ccaagagggg ataggaacag aaattggtag attcattcag 72480 

gacacagtta aaataaatat atgaagaaat taaatctgtg actgaattgc ccttttggac 72540 

cacacgataa tagctgacaa ttaaggagta tagtactatt tggtcaatat atagagtgag 72600 

ttcaattata tatcttcaaa gaaggggcca ttttaactga gtattcccct tggttcttca 72660 

gatctgaaag aacctagaga tttctaaatg ggaacataca acccttaata catattcctt 72720 

tcttctcata gcagagaaca gcacagtggc tattatggat ttggagagag ttctgttttg 72780 

attcttggcg ttcccaaccc acatgtagcc ttcagtcaca aacgttcagg gtttcaccag 72840 

ttgtcttccc tctgtgagtc tctaatgctc tctgacttat cttaatatca gacagttagt 72900 
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ggatacattg gttcacatct ttgaaggcac tgaggggact aactcaccaa aaccaacaag 72960 

agatgagtcc ctgcaggcat ctgggggtct ctgctgctct tcatcaggca ctctgataca 73020 

tagatagaag aatgctatgt gtaagactta atttcatact gtctgagagg agacagcctg 73080 

caaagacctt tactagctca atagcttcag tgataaatga gtgtctggaa aatgttttaa 73140 

tgtgtcccat cattgtctga tctgttttgg aggctgcggt tggagttatt cagagctgtc 73200 

atcactgcgg tgtcccctgg ttctcccatt ggtctgggca ttgcacgtgg ggttggcagc 73260 

cgatgagcag ctgggcagtc tgtacatagg caaggtggac tggttgccta cagtctgtgc 73320 

agtttctctg tgtttcagca ctgactcgtg gtctagtaat tgcaggtatc cagcaagcca 73380 

aatcacactt cattgctact gctgtgctgg tctgctagac tgatgaaaat ctctgttaga 73440 

ctgcctcatc tctctttttc tgcctggata agacttatta aaggagaaaa gctgatatat 73 500 

cactctcaga ttttctagat cacgaaaaca ttgcagtgca ggagccattc aatccagctc 73560 

atactccatt taaatgctga tcagaacact tagtgcatcc acgtacgttc ccagagagct 73620 

ccttgttggt gcctttgcca aggagggcta tcccatgaaa gcacacagga atgctgcctc 73680 

cggcagagac ctgtctgtct ccaccttctg attaacacaa aggtacccaa gactatgcaa 73740 

ggcaagctga tatttcagat aagaagttgc ctcagtttac agaaggccat gaaaagctct 73800 

ggttcatttt ccatatctgc ttcttctctc tgcttttgga aaataattct tatttcctac 73860 

tagcaagtcc agactagtgt aatatttagt cttgatttgt taaatcatcg tgaattttag 73920 

cttttactaa tggtatctta gatgatctaa tatactaaca cctagtaagt gacttcagat 73980 

aaggtgttag gcttatacca cagcccagtt tgaaggagct aagtgcagat ggcaaccaaa 74040 

caagcaacaa cacaaatagg cccaaatagc ccatggaggg ccagaaagta cagctgcagg 74100 

cagagctgtg atgtaatcca tttttgtgag ccccttgaag ccagcaggcc agcctcgtgc 74160 

ttttagtgta atggacatca tacagccaaa aaggaaggtt atgccataat ccttgctcca 74220 

ttcagcaatt ggactcgagt gtactagccc tgtttcctag acccttccag tcctatggaa 74280 

atttagaagt cccaatgtaa aacctattca acttgcagca tgtgagagga agatatcaga 74340 

acagctcccg ctaacgtgag aaatgccatt acagggtact gtactgttcc acccattctg 74400 

gcctctgggc tgtgggcaag aaactgcatg ggaggacatg ggaaaacctg tgtgggagct 74460 

tgcatataga aatgtatttg tactcaaagg cgttggctgt gacggagaaa gtgagcatag 74520 

gtgaggactt gctcagatca ccaagcaacc ccacctcacc tttcaacaca gcattactat 74580 

ccaatgatgg gtaactggcc tgatggaaaa aagcaagcca tggctggctt ttgcaggcag 74640 

ttgcaaactg cagtgttatg cagtctagcc atctcataac ttgcttagct gcatagtcat 74700 

tgcccacttc ctgctcccag cattttgtag aagagaactg gtacatctaa atgctttgca 74760 

gtagcaggaa ttggttttgg agatgagcag ctggttttga aacttgaaaa atgccacata 74820 

caggcaattg gcctggctgg aaaagcagtg cggttttgag ccttaggtat gttacatgtg 74880 

caagtgtaga gtcctacacc tggggaggaa tgactgcagg taccagtaca ggttaggggc 74940 
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tgagctgctg 


gtgaggagct 


ctgtggaaaa 


gaacctcggt 


gttctggcgg 


gcaacaggtt 


75000 


ggccatgagc 


cagcagtgta 


cctttgtggc 


ccagaaggcc 


aatggtatcc tggggtgcat 


75060 


taagaagaat 


gtggtcagca 


ggttgaggaa 


ggtgatcctc 


cctctctgtt 


ctggtgaagc 


75120 


cacatctgga 


gtactgtgtc 


catttctggg 


ctcctcagtt 


caaaaaagtc 


agggagctac 


75180 


tggagagaga 


gtccagcaga 


gggccacaaa 


gatgcctggg 


gtcctgtagc 


atctcccttg 


75240 


tgagaaaaga 


ctgagaaacc 


ttgggttttc 


cagactggag 


aagataaggc 


tgaaggggga 


75300 


tcatatcagt 


gattacaaat 


acttaaaggg 


cagaagccaa 


gtgaataggg 


ccaggctcct 


75360 


tttggtatcc 


tgtgacagga 


aatgggcgaa 


aattcaacac 


caacaagagg 


aagtacttct 


75420 


ctactttgag 


ggtaacagag 


gactggaaca 


ggctgcccgg 


agaggttgtg 


gagtctcctt 


75480 


ctctggaaat 


attcaaaacc 


tgcctggatg 


ctttcctgtg 


caacctactc 


tagggagctt 


75540 


ctgtaggagt 


gagttgaact 


cagtaatctc 


cagaggtccc 


ttccaacctc 


tacaattcta 


75600 


tgattccatc 


ctaacggcct 


tagaagggtc 


agaatttgca 


catacggtat 


aatgttctga 


75660 


ggcaaagcag 


tgaaacagat 


tgcaaagaag 


ctctaaggag 


atatagagag 


caaatcaaag 


75720 


aaatgagtgg 


aggaacgacg 


tcagtgtaaa 


agggagggag 


aaagcacaga 


gtttggaaga 


75780 


acgaaagcag 


tgaaaggtat 


tcaaaaatgg 


cactgaaaag 


tggctaggac 


tcacaaaagc 


75840 


agcagaagaa 


aaaatggaga 


atggaatggg 


aaaggcaata 


ggcagaaaga 


aagaaaaaga 


75900 


taaagaggca 


ggaacaaatg 


actaagaagt 


ctgaagaaat 


atgcagaaag 


gaaaagcaaa 


75960 


caacaaagcg 


aatccaaatg 


gaacagaaaa 


aaagtgaaaa 


gaaggaaaat 


attactgagg 


76020 


agatctgata 


ctgtgtgcag 


atttgtgctt 


cccatctctt 


tctttgcatc 


tcactgcatg 


76080 


ttggggacat 


aaccccatgt 


gatcagcctc 


tcttggactc 


ctattcttag 


ctccagtgat 


76140 


ttaaaagaat 


accccctggg 


tggggattcc 


tgttacagat 


caaggaaata 


cttcttcctc 


76200 


tctctaatta 


aaatccttcc 


actcacagaa 


agctggctct 


gtacctgaat 


gcgtgcctac 


76260 


tacctgtagg 


caacataaag 


ctcatgcatt 


tcctattcat 


ttgctctatt 


tctgcaagca 


76320 


agctcagccc 


caaacaaggg 


atctctaaat 


cctagcaaga 


accctgcaca 


ccccagtgtt 


76380 


caagtcctga 


caccaccaaa 


ttcaaaagga 


actacacaca 


gcaccacagc 


catggatctc 


76440 


cagtgttaat 


gctgttctcc 


agctaagggc 


gacttggctt 


tgcagtcagg 


agatgttgcc 


76500 


aggatgcctc 


ctgtcaaact 


agttgggcag 


ctttgagcga 


aatgctgtta 


gcacattgct 


76560 


agatataggt 


ttcctggtct 


tctgcaggaa 


actgaaggat 


gacatttgca 


tgaaattaca 


76620 


acgtgcagcc 


tttatcaaca 


attggctaga 


gactgaattt 


tcccacaaga 


aagtggaaga 


76680 


aatttaaaat 


agagtataca 


caggaaggtg 


ctccagagct 


cagctgttgt 


gttcttcatt 


76740 


tgacctcctt 


gctcaagaag 


gtaaacatta 


tttttctctt 


caaaaataac 


ttgtcttgtt 


76800 


gttgttgttg 


ttttggccaa 


atcagtctaa 


aagttggtaa 


atttcatgtt 


tatagatggg 


76860 


gcaaaggggg 


aagtactttc 


acaggctgga 


gagagcaaaa 


gacactgcta 


aaatttgggt 


76920 


ggtcttcacg 


aagggaggtg 


gtcttctctg 


gggtgaggct 


gggaatttag 


gaacacatgc 


76980 
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ccaaagctat 


gaatctaaag 


atgcctgtct 


aaattcctca 


gacttttgac tgaaatttcc 


77040 


ctcggttctc 


cctgcctgct 


tggagagcta 


taactgccac 


agactgagtg 


gtttatacca 


77100 


catgcagatg 


ctttgctgcc 


tacatctcca 


gaagggtcaa 


agggctgttt 


tagaacagcc 


77160 


caactcactc 


taaaaaaatg 


ggctttatga 


gaagcgatgg 


tgcagatcat 


ctggataaac 


77220 


tcacccataa 


attaatagaa 


acaggttaat 


tttccttctt 


tactcaggtt 


tccacagcac 


77280 


aaggaaaagc 


cttgaaatgt 


tcactagaca 


agagagggca 


cgcaactctt 


tggttgcgtg 


77340 


cttgggtgtt 


tcctctgtac 


cctggtctct 


gctgctagga 


ttgtttatgt 


tcttaaacaa 


77400 


tggctgtata 


taataggaag 


ggtggagtat 


tcttcagatt 


ttgttttggt 


aatggggatg 


77460 


cttcactatc 


aacatatttg 


ctcctggctt 


tggcagcggt 


gttcaaaaat 


ttgctgagaa 


77520 


gtttatgtaa 


ctacacattg 


gcataacaaa 


tagcttgcca 


ctgtatggcc 


aatgtacatc 


77580 


cattcctgtt 


caagcaggaa 


taatcagcct 


agaaagaagc 


aggaaagata 


catcctggag 


77640 


gtaccgatgc 


aaaataatag 


aaccagtcag 


gaaaagccct 


ttcctgtata 


aaaacagcat 


77700 


catgaggggc 


taagttgctt 


gggagatgga 


cttggcaaca 


cttctcctga 


aaagatattt 


77760 


tgtgctgaaa 


tgtattgggt 


tttaatttaa 


agcacattgc 


tttggaaatg 


ctttgtgttg 


77820 


catggggaag 


actcgaattt 


ctgcgttaaa 


ggaaatttca 


tttttcttat 


gtgttgtgtc 


77880 


cctiztaaacc 


caaaaagcca 


cagaaacact 


ttgaaagttt 


ttgtatgaat 


ggtcatgaaa 


77940 


aataacttct 


acaaccatag 


gcttttcatg 


tgaggacact 


gtattatctg 


ttgtgttctc 


78000 


cttttctagg 


atagacacgt 


atcatttccg 


ccaattctct 


cttcctttgc 


ttatgagaaa 


78060 


taaatgtata 


ttaaagcact 


taaatgagaa 


gaagagtaag 


tatgcaattg 


gaattatcat 


78120 


gcagcatcag 


ggaaaacagg 


tttcttcttg 


ctttcccttt 


ctacatatag 


aactgcctta 


78180 


caaaccaggc 


taccactctt 


tcagaatctg 


cattttattt 


actgcctcct 


cctgttgact 


78240 


gcataatgta 


acataccaca 


tcttttaatt 


atgatagctt 


tgagcgcagc 


ttttcattct 


78300 


tcagtaagtt 


ttgccttgat 


ttcatcttta 


gcctaaaaca 


agctctacag 


agagaacaga 


78360 


gcgtgaacag 


ctatagaaaa 


ggagtatttt 


tcacttcacg 


gagccatgga 


agcaatttgt 


78420 


tatccttaca 


agacttctgg 


tatacagtgg 


tatctacgga 


aggaggctct 


tttcctgggt 


78480 


agatccctgc 


tcacatataa 


caacgccagc 


aattccacct 


cccagactgt 


taacagctac 


78540 


tgagccatca 


tgcaaagcat 


ctcccttcat 


cagccataaa 


accacagccc 


tgcttgctgc 


78600 


ctctgcacaa 


ttgctaatgt 


tctgtgcaaa 


cagtttgtct 


ggtgcagaac 


aacagctgtg 


78660 


atctttctgg 


aacacttctt 


ttgatcttgt 


attttctctc 


cttcccactc 


caagatcttt 


78720 


taaaagaacc 


atttccattt 


gttgcccaac 


atctaggtgt 


tctcaaagtt 


actctgccct 


78780 


cacggtggct 


ccaaaactca 


ccaacaaatg 


attacagaga 


tcataagcat 


ggcttaatga 


78840 


tgtggaatca 


tacctacaca 


tactccctct 


ccaaatatcc 


attaagaaag 


ttcactaaat 


78900 


cctttggttc 


tctagtaaga 


aagttccttc 


tccagcccac 


atcccttctc 


cctccactgt 


78960 


tgcattgctt 


ttctggggca 


gccctgtaaa 


tagctcacat 


gaagccatgg 


aattggtggc 


79020 
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actctgaaga cagtctgetg ctttttctaa 


a yyLd LyydL 


79080 


a r~ -\~ t — I - n +~ "Zi e~ 
dLC LL ig tdL 


yL-L.dydL.yL_ L 


tgcctttaag 


acctgtttcc 


agctctcatg 


f~1* r 1" r r r +■ r +■ 
llllllllll 


7Q1 40 


9 Lye Lxyyi-y 


y l Ly y l ll ll 


tccctgtggg 


ttggggtgga 


ggtgcctctc 


l ll. Ly l Lydy 


79700 

/ zf c. \J\J 


yaay L LLa L L 


agctcctgtt 


gtctcctcga 


cgccttctga 


ggtctagaca 


v— LaLauL 




d LyLQ LLL Ly 


acctacattc 


acagtaaaca 


acctcttaga 


tccattttag 


atcttttarr 

«— I- L L LCILL 


79320 


d y v_ i_ y luq ci o. 


gtggagcaac 


acaaacttta 


acatgaaaga 


agtgctgagt 


"tttalztttca 


79380 




aataatagct aacgagggtg 


gaagaaaaga 


gaaatgatta 


zj l. y l y 


79440 


LLLLLLLLyL 


ggtaggatga 


ctgcccattt 


atgttaggcc 


ttcatatgaa 


y \x v» l d. V— y y 


79500 


l L^ayy y l 


gaaacaagtg 


tcttagaatg 


aaacatatat 


gaacttttta 


L L L V_ddy L L d 


79560 


yy i-ddddyyd 


aataaatgee 


tgcacttgcc 


acatatcagc 


accttcatat 


ciVTc a fir aar 


79520 


L Ly Qv. L L L 


tgtcaatcta 


tcttaggcta 


agectttttt 


cttgtgggct 


y c-v y i— i_ v. ci. i_ i_ v. 


79680 


LLQ L l y LL Ly 


ggacttgctg 


caagctaagc 


tgctcgcaca 


gaeaacttge 


l y v_ d v_ v_ LL.ay 


79740 


LdyayLLd Let 


gcaacttctt 


acaccctgtt 


aactttggtg 


cctgagcccc 


LaL L Ly LLa L 


79800 


dLddciyd LLL 


tgcctgtctc 


acacctgaat 


gagaggcagt 


gtgtgttccg 


ratrrf tnra 

L d LLL L Ly LCI 


79860 

/ ZJ iS> \J\J 


yLC.ciyxyL.ciy 


gaegctgagt 


agttcttgtc 


ccagagcagg 


ctgaaagcta 


npnrrarrrf 

ydyLLdLLL L 


79Q70 

/ ZJ ZJ £~\J 


yciL.L. Lydy Ly 


ctttctctcc 


acactgtgct 


atatattttc 


ccctaaataa 


d d La LL LLLL 


7QQ80 

/ ZJ ZJ 0\J 


LyydaLdLdy 


gecacagtta 


cttatgtctg 


caagcagcca 


agagcatatg 


rl"l~l"ort*l"l"1" 

LLL L y L LLLL 


R0040 

L> \J\J*-\\J 


L L LaLa L d i_ L 


tctggtgtgc tgtccagaac 


atcctttgtt 


tgacactaaa 


at*1"ria1"Ot* nt* 

CL L l y cx l y L y L 


80100 

U v i U V 


n r 1" 1" t"t*1" a 1* a 

yL L L L L Lu Ly 


gtacaatatt 


ttgagaaaaa 


cttgagtact 


ccactgctat 


V_. \w V*. v_ n. v_,u u, \ — tv 


80160 


yL L L Latay l 


tatttcccta aaggactgat 


aagggcttct 


taaaagcett 


LLLLLLLLLL 


80220 


L LLciyaLyyL 


attcttcatg 


aaaagaccaa 


gctgaaactt 


agtcccaaat 


1~ rttrttarr 

LL. L LL L LCIL-L 


80280 


dyay Lyyd l l 


taatggccca taggaaaggc 


ateagactge 


tgtatttaca 


ni~a raanana 
y LdLddydyd 


80340 


aaayaa Lyay 


acagatcttg 


tcctgccatt 


gaacaggaag 


cttacagact 


t" 1" c t~ n n o n c t* 

l ll LyyyyL l 


80400 


yL. l y d y l Ld l 


tgcttcgttg tgaaattgee 


attegttate 


cattctgaat 


Lay l y y L LLL 


80460 


Ld LLQaa. LLu 


atgaggagac atgaagtata 


ctgcaaacag 


tgcatgtttc 


c at" aa a *t aa "t 

' — n. Lay y my t_ 


80520 


ayua L LLQ Ld 


getgettacg 


ttccttcttc 


atacatgaaa 


ataattacta 


y ma lll mL 


80580 

UL/ JUV/ 


L L LLaLyadL 


ctgttgtttg 


aatccttcac 


actgcagctc 


aggttaccag 


atfitnnttan 


80640 


axgeccy xyx 


agtttctgtc 


accccaatct 


gtctctaatc 


atgttgttac 


ddydyydddy 


80700 

0\J / L/L/ 


ddL LydLyLy 


atgacacaca 


ttaaactagt 


ttgtagaagg 


aaatccaegg 


L Ly uL L y d L L 


80760 


Lddd LdLLclL 


aaccttttgc 


ttacaaataa 


gaacaagaca 


gacagaccac 


y y y dd d L ll L 


80870 


tttggaaggg 


atcagataca 


ttgtgggata 


agatggaaaa 


acaattctct 


ctaaggaatt 


80880 


ctcatatggt 


atgagtattg 


gggccccttt 


ccagatcctg 


ctgtattcac 


atgagtgtga 


80940 


attaatagat 


gtgtgcaaaa tcagctattt 


caaactcaga 


attcagcaca 


cttctactat 


81000 


ttagcaaccg 


actatgggat 


gattttaggg 


eggacagata 


cttcacagta 


tgatacagat 


81060 
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aagcaatcag 


ctgattcaca 


tttctccttt 


ccctttttgc 


tcccagtaag 


ctgcaggctt 


81120 


cacaatgggc 


tccatttcta 


gaatgattat 


tgagttttgc 


cttgatctct 


acaataaact 


81180 


caacagaaca 


gcaaaaggcc 


aaaacattgt 


cttctctcca 


atgagcatct 


ctacctccct 


81240 


tggcctgatc 


cttctagggg 


cacgaaacaa 


cactgctgct 


cagatagaag 


aagtaagtac 


81300 


tgctgaaatg 


ttctgagata 


cttccacata gcctgctgtt 


cccccagtgg 


caatgctggg 


81360 


ctttgcagca 


caacatgtgt 


gcttaggaga 


caaagataaa 


cacaagctca 


actgctgcct 


81420 


tgagagcagt 


gcttggtgtg 


ctgtgatccc 


tgctcactta 


tcaactgtga 


cattcaaacg 


81480 


attcaacatg 


tctcacctac 


agagcacacg 


gagcctgggg 


gtacagggtg 


ggcatgcaga 


81540 


agtctgttcc 


tctggtcacc 


atgcctttta 


ctccctgcag 


tgcaagctgt 


atgctctgag 


81600 


atcttttatt 


tcttttctta 


tttgtttctg 


agagcagtaa 


gtgaccaata 


ctcctaaggt 


81660 


atatgtggca 


taaggcagta 


gctggctctg 


gctgtgtcct 


ggtggatctt 


catccattgt 


81720 


attataatat 


tgccacaggt 


cagctgctgc 


caagggaaac 


tcattctcct 


tatgaggttc 


81780 


tgagtgactc 


ttgcttagtt 


taggaaagca 


atggagatcg 


agtactctca 


acaaggggga 


81840 


atggcgtcta 


actaaaagag 


cggaaattta 


ggtaagatgt 


taggcatata 


ttctttacac 


81900 


agagggcaat 


gaggcaccag 


cacaggcttc 


ccagagaagc tgtggtgcgc 


catccctgga 


81960 


ggcgctcaaa 


gccaggttgg 


atggggccct 


gggcaacctg 


acctggtggt 


ggcatccctg 


82020 


cccacagcat 


ggggttgggg 


ctgagtgggc 


tttgaggtcc 


cttccaaccc 


aaacctttct 


82080 


atgacagtta 


ataaatctac 


atcacttatc 


caggacagcc 


cagtaaatct 


ttcaaacaag 


82140 


gaaaatgcct 


ttatcccagt 


taaaattgcc 


attaatttga 


cctcttcaac 


tgcaggttct 


82200 


ccacgtcagc 


aatgccgcag 


gaactacaag 


ccttgaatct 


gagcttgaag 


gtgcagtgcc 


82260 


cgaaaacaag 


tctgaactaa 


gccaggaaag 


agagtcttcc 


ccctctctgg 


tatgtctttt 


82320 


ttagtacaag 


agtctttcac 


tccacagtag 


cctattagtt 


gtaaagcacc 


acagcctgcc 


82380 


acaggaggga 


gtcaagatcc 


catgcacaac 


gtctgcctgg 


tctactacgc 


ctgattgaag 


82440 


gtgttccctt 


gtaatcagcc 


aagtcctcca 


taaagtcaaa 


tacaaagccc 


ccaccagaag 


82500 


gaagatcagg 


ttacaaaact 


tagattagct 


gaatttaaat 


ataattacag 


tgggagctag 


82560 


ccctacactg 


caatctaatg 


aggatgcaaa 


tgaacaacca 


aagctatact 


gaggaatact 


82620 


tgtaattggt 


gtgtttgaaa 


tattcctagt 


gcaacacaga 


tgggaatctt 


aaccacgaag 


82680 


cgttccatgc 


actgctttta 


caactacaaa 


accttggcaa 


agactatgtt 


ttaagcctgg 


82740 


ctaacagcct 


ctttatccaa 


caaggatttg 


aaccgcatca 


ggtaagataa 


ctgtaccttg 


82800 


taacctctgt 


ggcgctgacc 


cccagctttc 


tggcaaccat 


atgcttcact 


gttgtccctc 


82860 


catgtgtatt 


tttgagcatt 


ggaggtgctt 


cttggagcca 


tatctcttag 


ggttgttggg 


82920 


aaagagacag 


aagtatcagc 


tttcagtgct 


tctgtttaaa 


acaaacaaac 


aaacaaagtc 


82980 


aagacaacac 


tctgtagagc 


aaaaataaag 


cagaagacct 


ttgacttttg 


gcatatctaa 


83040 


cttgagccag 


aagtgcgact 


acagcaaaaa 


aatggcctat 


tcaagctgtc 


tgcaagctgc 


83100 
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ttctgggcta 


tctttctatt 


tgcagctttg 


cattgctggc 


tttcctcttt 


ttcttctttc 


83160 


tttctttttt 


tttttttttc 


cccctgctga atgatttgga tacttgagaa tcacccaaca 


83220 


catcttgcat 


cttctctaat 


ttttttttct 


tttctatttt 


tttaaatttt 


tatctggata 


83280 


cctgcatact 


tcaggtatgc 


agttttctgt 


gggaagacat 


tgtcatctag 


aggcaaaaat 


83340 


gtatataaat 


aataagaaag 


acacaataat 


aatctctttt tcaaagatta tctgaatcag 


83400 


cttctgatag 


ttgatgtttc 


caaagccaaa 


ttttgtctct 


ttcagtcaag 


aagaccctca 


83460 


gaatttctaa 


aacgtttctg 


aattgttgac 


ttcatgttaa 


agagaataag 


ctctgaacag 


83520 


gtttggctaa 


ttcacaatct 


ttattctgct 


ttacagaaat 


atctaatgtg 


cagtaaggaa 


83580 


ctatacagag 


cagcccttga 


aacagtggac 


ttccaaaggg 


ctcttgaagc 


aagcaggcta 


83640 


aaaattaatg 


attgggttga 


aagcgagaca 


caaggtaaaa 


cagagcaaaa 


ctgtagctgt 


83700 


gctatcttct 


ccctcttcca 


gtgctccttc 


aaaaagaatt 


cagcatatga 


taagtcttgt 


83760 


tcatgtttct 


aggtttctca 


tgcccgtcaa 


agatagtttg 


ttgttcccaa 


tcattcttta 


83820 


gagtcatcta 


ccagctaaac 


tatttctgag 


ttaaagatgt 


gtttgttgtc 


acatactgtc 


83880 


atactcctac 


ccacatgcct 


agcaagataa 


ctgcaacagt 


acctctaagg 


gttaaataga 


83940 


ttaattgctc 


ctgcaaatag 


ccaacactgc 


aggtacagta 


aagcagagga 


cggaagttat 


84000 


gagcgtcaca 


gtgagactgg 


gaacagcata 


gcagagagag 


aagacacctg 


aggacctggt 


84060 


gttgacctgc 


tctggtcgta 


cacagagcaa 


tgctaacaaa 


gatgagtgat 


gtgcccacca 


84120 


gagagatttc 


actgttacaa 


gtaacaacca 


accagctttt 


gccctttaca 


ggcacataga 


84180 


ggtcattggc 


tttttttctg 


attaagctga 


acatgaaata 


tgccactttt 


attttgtcag 


84240 


agatgcaaca 


tcagcagggt 


gaaaacctta 


taaatcttcc 


agctgaactt 


aagccagaac 


84300 


ttactgaggg 


aaattactga 


tggatgaata 


gatttgaagg 


cttctgattt 


cttaatggtc 


84360 


atatcctgac 


caaacctgtc 


cttgggctga 


cagagcagcc 


tgtgactaat 


gtgggaaaga 


84420 


gctgcaaacc 


ccagaccatc 


attgctctgt 


gtgcctgtac 


aaagcctgcg 


cgcttgggaa 


84480 


atcctacttc 


acctctgtac 


agaaaaaaaa 


agggtaaagg 


gaaagatgcc 


ctcatgtaaa 


84540 


ctgaaacaga 


ggattaatgg 


cgctgcgcct 


tttactgtgg 


acaggtgcca 


cctggaacat 


84600 


tcattttgcc 


actgatccca 


cagtaggcta 


atttgatgat 


cggtgcccct 


tcctctccct 


84660 


aacaggccag 


tactaggtaa 


cagtgctgag 


aaatttacca 


tttctttgct 


tgtatcgtcc 


84720 


ctgttctgtg 


aagaaacaaa 


cagttggatt 


tctaaggtac 


tctaaagcta 


agttcacaga 


84780 


caagtaattg 


agtctcaatc 


cagagcctta 


ataacaacta 


ataaacacct 


gtgttttcca 


84840 


aaatttcctc 


caggtaaaat 


caaggaactt 


tttgctccag 


gagtgattga 


ctcacacacc 


84900 


attctggtgc 


tggtgaacgt 


gatctacttc 


aaagcatcct 


gggaacacaa 


gtttgaggag 


84960 


aaaaatacag 


tacagagaga 


ttttaaactg 


aatcaggtag 


atatgcattg 


tataaatctt 


85020 


agcatgattt 


acctgagtta 


gcatgattta 


catgagttgc 


aacgactcag 


cattttgttt 


85080 


caatggctga 


caaaacacaa 


agcttcagcc 


ctgatcagcg 


cttttgaacc 


taatagtcac 


85140 
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tatgggcagc 


tgtcatggat 


agaagccaat 


tgcaaagatc 


tcatttcaca 


caggctctgt 


85200 


ggggccatcc 


tggcttttat 


gcatcccgta 


caattcagcg 


tgagccatgc 


aacagatagg 


85260 


ttaaaccaaa 


ccaatcaaaa 


aaagaggcca 


gatattaaca 


agccacatat 


atgaagatgg 


85320 


aatttgaaac 


aggaaaaatc 


ctcacagagt 


gttttggttt 


atttatagta 


tctgcaatgt 


85380 


ttaaaaggtt 


ttttttaaaa 


tatttttttt 


attttgattc 


cttttttcca 


ccgtacatat 


85440 


aaaatggaag 


ttttcattgc 


tcaactaagg 


tacagaatca tagaattact 


caggttggaa 


85500 


aggacctcaa 


agatcatcaa 


gtccaaccgc 


agcctaacca 


tagtacccta 


actctaacaa 


85560 


ccatctgtta 


aatcatatct 


ctgagcacca 


catccaaacg 


gctcttaaac 


acatccaggg 


85620 


atggtaactc 


aaccacctcc 


ctggggagcc 


tatcccagcg 


cttaacaacc 


ctttctgtaa 


85680 


agaagtgttt 


cctaacgtcc 


aacctaaact 


taccttggca 


caacttgagg 


ccatttcccc 


85740 


tcgtcttgtc 


acctgttgcc 


agtgagaaga 


gacctacccc 


gctctcactg 


taagcacctt 


85800 


tcaggtactg 


gaagaaaata 


ataaggtctt 


ctctcagcct 


cctcttctcc 


agactaaaaa 


85860 


gccccagctc 


cctcagcttc 


tcctcgtagg 


actgattttc 


caagcccttc 


actagccttg 


85920 


tgaagctgca 


aaaagttctt 


taacaaccac 


attaatccaa 


gctctgtaca 


gctcaagtct 


85980 


aacaaatgtc 


ttcaaaaaag 


atgatcaaaa 


ccattttatt 


tcatttaatt 


cagttttgtc 


86040 


ttcattccat 


atgctgtgcc 


tatgttacac 


taaataatga 


agccgccaaa 


aaaatgaacc 


86100 


cacaaaaaac 


acagatttag 


ctctgatctg 


aagttgaaga 


gctttgtatg 


ggaaaaactg 


86160 


tattctaagt 


gtttcttatc 


tatacaaaca 


aaaggtcaga 


aagacatctg 


ttgctagccg 


86220 


tagtgttgca 


ctgccattta 


ttaagacacg 


taagaaagtg 


taattttggt 


cccttaattt 


86280 


ttttacttga 


aatatgtctt 


tgaatttgaa 


tactgaaaac 


tgaccttagg 


taggaacatt 


86340 


tggaacactg 


ctgcagtcac 


agaaactatg 


agattggggg 


aatctgcata 


tacttttctt 


86400 


catgcactaa 


ttaataatgt 


tctctactaa 


aattcttccg 


ctgatttaga 


aggtaagtaa 


86460 


aaacttagct 


aatggtgaaa 


tgaaccttga 


gcctttacac 


aggatttgaa 


caaactcatc 


86520 


acaaaagaaa 


atgaggctta 


gaagacctag 


aagaacatgc 


ctgagattgc 


tcttaatctg 


86580 


tctattgctt 


cctgcctaaa 


acatctacct 


gataaatgac 


aacctgattc 


ctgcagtgct 


86640 


atttcttctc 


tatcccattc 


caaaccagga 


cttgcaaatc 


ccatcagcat 


cagcttgttt 


86700 


ggctggagag 


taatggtatt 


aagccacttc 


actatctgat 


cagttgcagg 


gaaattgctt 


86760 


tgttttattt 


tgccccccag 


agaattatct 


cctttataca 


tgaatggcaa 


aactgatgtt 


86820 


ttacgtgtcg 


ttgtatgtgc 


aacaaaataa 


agaaaaaatg 


tttagcttta 


taacaattac 


86880 


tgctgcaaac 


acagactact 


gatattgcac 


ctgaagttta 


aacattaagg 


tctgtattgc 


86940 


ttgtgtgatc 


attccaattt 


ctttttaaat 


agaatgagag 


aaagccagta 


cagatgatgt 


87000 


atcagaaagg 


cacatttaaa 


ctaggctata 


ttgaagagct 


gggaactcag 


gtgcttgaac 


87060 


tcccttacgc 


tcagaagttg 


cttagcatga 


tcatcctgca 


ccaggagaga 


cagcagatgg 


87120 


atctcccagt 


ggggctggaa 


caggtaaggg 


tgaggactgc 


ggctaagccg 


gactgaaagc 


87180 
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tggttgtctg 


aattaaagct 


gggcaaaaat 


ctaaacttgt 


taatttcccc 


atcttctaga 


87240 


ctgaaagcac 


aatgacctat 


gaaaatttaa tgctgtggtt 


ctcttccgaa catatgtttg 


87300 


agatggtggt 


agaggtgtac 


ctgccccgat 


tcaagctcga 


aggcaccttt 


gacctcaatg 


87360 


aggtattaaa agcaatggga 


atgactgaca 


tcttcagtga 


atccaaagct 


gatctttctg 


87420 


cattgtcatc 


tgagaaatcc 


ctggtgttgt 


caaacattgt 


ccacaaggct 


tatgtggaag 


87480 


tcaatgagga 


gggtactaca 


gcagcagctg 


ctacaggagc 


taccattgtg 


aggaggtctc 


87540 


ttcccctcat 


agaggtgttc 


atagctgacc 


gtcctttctt 


attctttatt 


aggcacaatc 


87600 


ccaccagtac 


cattcttttc 


tttggtaaat 


tctgctcacc 


ttaaaatcaa 


ggccatcttc 


87660 


tagcattgtg 


agaaaaacct 


ggatgaatca 


gaaatactat 


ttttccccct 


acaccttctt 


87720 


attcctatga 


atgattgtag 


atcaaagtaa 


tcactgcagc 


caacctagcc 


tagaaccatc 


87780 


aattgaatgc 


cctcctgtta 


tgctccttga 


atggcaaata 


ttgatctgaa 


tctaaaacag 


87840 


gagtaagttt 


tcccttaacc 


tgactggaaa 


tcaagaatat 


tttgtttctt 


caaggcgtac 


87900 


atacactcct 


gtatagccaa 


gtatgtccgg 


catagccaag 


taatgtagta 


cactatttgc 


87960 


ctggcaaagg 


tagaatttgt 


atgctgctac 


ctgaggagaa 


ctgtttgtaa 


caattttcag 


88020 


taactgccag 


taaaagtgga 


gtatttttat tttctctgta 


gtttttgatt 


tcctgccagg 


88080 


tgggacttga 


ttaacagaga 


ggggctttgg 


aaatgcttta 


tacttataca taatctgtat 


88140 


ttgtggcaaa tccttcgcac 


agtggagatc tcactttgat 


aattcccttt 


cctgtagcag 


88200 


cagtcacaag 


caagcaggaa 


atacttattt 


acagcaaatt 


cacgtgttta 


ctgacaactg 


88260 


taccaccttt 


ccccccatga 


tgtatgctgg 


atctatcctt 


ttgccatata 


aaacgtttat 


88320 


gctagaagca 


gctttggttt 


catttattta 


tttagatata 


agcctgcatc 


tgaagcacca 


88380 


actcatcaac 


tggaagatag 


atggaatatg 


acatataccc 


ctttcacaat 


cccttggttt 


88440 


tttccacatg 


agttctgtta 


gaagcactgt 


atttttcctt 


ttttaagata 


acaacagtag 


88500 


gaacactcat 


ggaaaggaca 


agattacgcc 


tcatgaacac 


atctagtaag 


agagttgatt 


88560 


ataacagcaa 


ctgagtatgt 


gggaaggcaa gattttgacc 


ctcgttttac 


aggatttttt 


88620 


ggcactcttt 


tttgaaaata 


aatccaccct 


taaagaatca 


cagcatggtt 


gatgttgcaa 


88680 


gggacctctg 


gaggacattt 


tgtccaactg 


tcctgttcag 


gcagggcaac 


catgtccagg 


88740 


gggcttttga 


gaatccccaa 


gcacagaaac 


ttcacaacct 


ctctggacaa 


cctcttctga 


88800 


gttcccacaa 


ttttgaatga 


caccaaagag 


aattttgtat 


gcgcagtgtc 


tgcaggaatg 


88860 


ggatgtgaaa 


acacacattt 


ctaaagctta 


attacttaca 


tagtgaagta 


attggttttc 


88920 


ttccttgagt 


tctgctctct 


ggtgaagttt 


aatgatctga 


gatgcatgta 


tatagatata 


88980 


caggtctctc 


cagccctgag 


gaatgaagaa 


aagttttgaa 


aagggcaatg 


taagcaatag 


89040 


aaatcacagt 


caaatattac 


ctggaaaact 


ttttagtctg 


agagataatt 


agaaaaatag 


89100 


aattagcagc 


tgactgatag 


agagacataa 


ctgttaagtt 


gctggtttaa 


cacaagtaat 


89160 


atcttcctca 


cagagttcta 


tgtgaggttt 


aactaactag 


cgttggcaac 


ttgtgctttg 


89220 
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tgacctataa 


aaaggcaagt 


atacattagc 


tattagtcat 


ataattgagt 


gtaaagctcc 


89280 


ataaagtaat 


tcatgattag 


cacagtttat 


gtaccaaaag 


ttacctgcgg 


ctctttggat 


89340 


aagaaagtct 


aggcatgatg 


ttcgagcaag 


aacaggcagg 


agtaggacaa 


taatattcaa 


89400 


acaacttacc 


cttactgact 


aatctgaaag 


cacagtacaa 


tgtaagcagt 


acttttccag 


89460 


attgtgtcca 


tgtttccatt 


ctggaggctg 


acagcacaga 


ttgcctacta 


agctatgttt 


89520 


ttattacctc 


caggtgtcat 


cacttggttt 


ttacataccc 


tggggaagtt 


ctgagcacca 


89580 


caacctcaaa 


catcagtccc 


acttctgcaa 


cgacaggaac 


agagattcct 


gtgatgaagc 


89640 


gtcgaataac 


acagtgtctt 


gctccagttg 


ttggaggaga 


tggttcatga 


taaatctaga 


89700 


gtgagattaa 


gacacagatg 


aggtcaaatg 


tcatccagct 


agtttatgac 


aaattctaag 


89760 


cagttaagga 


atgtgggaaa 


catggcaaag 


ttagcaacag 


taaagggagg 


aattctagca 


89820 


aactggctat 


agagcaggga 


tactcacccc 


catggatcta 


gcagtatccc 


attggtttgc 


89880 


aggaggttgc 


aggtcagtca 


aagacatatc 


actgatctgc 


acagctgcag 


ttcagtggag 


89940 


gattgtctct 


gttctaccac 


tgaactcttc 


aggctttatc 


ctcttcattc 


tgctctcatg 


90000 


caccttcagt 


tactcagggc 


caatggcatg 


tgtgcctccc 


attgggtgat 


cggctgttga 


90060 


tcatgcagca 


atcacacacc 


tgccacctgg 


cacgctgttc 


ggcatgtgta 


ctgacttaat 


90120 


ggaagagacc 


ttttaagctc 


atctagtcca 


actcccctcc 


actgaagagg 


gacacctaca 


90180 


gctagatcag 


gttattcaga 


gccccgtcca 


gcctcctcaa 


tgtctccagg 


gaaggggctt 


90240 


ctaccatatc 


tctaagcagc 


acattccagt 


gccccaccat 


cctcactgta 


aaagaatttt 


90300 


tctttatatc 


caagccaaat 


ctcccttcct 


ttagtttgaa 


actatttccc 


cttgtcccat 


90360 


tacaacagat 


cctactaaag 


aatctgtctc 


cttcttctta 


agagctccct 


tgagaaggga 


90420 


gctcttctca 


ggtcaccttg 


gagccttctc 


atatccagac 


tgagcagtgc 


tagttctcag 


90480 


cccgtccttg 


taggggaagc 


attccatccc 


ttggattatt 


ttcctctgga 


ctcacttcaa 


90540 


cgtccatgtc 


tcctctgtac 


tgaggactgc acatttggat 


gtagtactct 


aggagaggcc 


90600 


tcaccagcat 


agagcaaggg 


acaggatcac 


ctgccttgcc 


ctgctggcca 


tgcttctttt 


90660 


gctgcaacct 


aagatacggt 


tgactttcta 


ggctgcaagg 


gcacactact 


gactcacgtc 


90720 


cagatgccat 


ctaccacagt 


acccctaaat 


ccttttctgg 


cagggctatg 


ctccctcttt 


90780 


tcgtattcca 


gcttgtaaat 


gtagtggggg ttgccataac 


ccaggtgcaa 


gaccttacct 


90840 


ttggatttgt 


tgaccctcat 


gaagttctct 


cgggcccact 


gcttgagcct 


gtatggatcc 


90900 


ctctgaatgg 


catctcatcc 


ttcaggagca 


tccactacac 


catacagctt 


ggtgtccttt 


90960 


gcaaacttgc 


tgagggtgca 


tcaaaatcct 


gttgacaatg 


ttactgatga 


agacactaaa 


91020 


gagtactgat 


cccagtactg 


atccctaagg 


aacactactg 


gtcactgatc 


tccatccaga 


91080 


cattgagcca 


ttgaccacca 


ctctctgggt 


ttgatcccgc 


agccagtttc 


tagtccacta 


91140 


gtcagcacac 


cactgatcat 


agccacactc 


gaaggggcag 


tcatgcaagc 


accaccctgg 


91200 


gtatttattt 


cccagcactc 


taaagcagag 


ctcttgctcc 


agctcatgtt 


attttctgtg 


91260 
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tggcaaggag 


tgagattcat 


cgactctagc 


aaatggaact 


aatggctcca 


tgtgccccag 


91320 


gtctcagctc 


agcaccagcc 


aggccagggc 


tgagtccccc 


cacatccaac 


ccataaggtc 


91380 


ccagaggact 


cctacgttta 


ccagtggtgc 


acagagatga 


gtttagccca 


agtccacccc 


91440 


tcagcctcaa 


ctcccttcaa 


cacctcttca 


ccaagaggcc 


caatccatca 


ctccttacca 


91500 


gccaaaacat 


atacttgttt 


aataccacag 


ccacaaaagc 


cacgtggtaa 


ggtctgaaga 


91560 


gaccaaaact 


gtggtttgag 


taaaacagaa 


ggaaagcctc 


tactcagtac 


cccacttatg 


91620 


actgagttac 


taggatagga 


cctgattcta 


cagcacccca 


ataccctgta 


gatgtattcc 


91680 


tttaattctt 


cacaccagat 


taaggctgct 


gccaccaccc 


accacaaata 


aatccttgct 


91740 


taggctgatt 


ataacttaca 


cctgtggctt 


ccacagtcaa 


atgagattcc 


cagtgcccac 


91800 


ctgcgtgttc 


aacttcctta 


aggcaaagca 


tcttgcagtt 


agcagagtgt 


taagaaatct 


91860 


tcttgtattt 


cctttaacac 


acgtttatct 


tccccagtga 


tgctgaattt 


gcaaatgctt 


91920 


tagggaaaaa 


ttggcagcaa 


gtccttacat 


aattactgtt 


tagcctagaa 


aataacaacc 


91980 


gaggtagaat 


acttcagaaa 


gtttctaatt 


taaggttttt 


ttcttgatga 


gagaaaagtg 


92040 


ctatcagagc 


tgtttagtaa 


ttccagtcat 


gcatgggtaa 


ctcattcttc 


tgtgttaggg 


92100 


tttactgaga 


ggtgaagaaa 


caagtagttt 


cttttcctta 


tgaaaaaaaa 


aaaaagtggt 


92160 


attagaagaa 


ccccataaaa 


gaatgccaaa 


cattgcagct 


tatgatgtgc 


aatgtgtcac 


92220 


tcagtcttac 


agatgacaca 


gcctggaagt 


aagcttaaaa 


aaaatgttta 


attcctaact 


92280 


tcttttgaca 


ccatctgtgc 


tgtggtttat 


gacatccatt 


aataatgttt 


atcactaaac 


92340 


aacaacagat 


agagagacca 


gaaactaagg 


atgctgctgt 


catttccttc 


tgatgcaagg 


92400 


tagaaacatc 


aggaaattaa 


ggcacactga 


aatattttgt 


aatattttgg 


actagaagca 


92460 


aaaccagaaa 


ctgagttgca 


tttgtctcct 


ggagtacatt 


ctacaggtat 


ttaaaaagag 


92520 


acaaaaacca 


taaatctact 


tgaatttaat 


ttgaagtatc 


aaatgaaaaa 


gatgtacctg 


92580 


attttattat 


cctccacact 


ggtcttctga 


acttgaccaa 


tcccactggt 


cagttactgg 


92640 


tttacgactg 


ctcaagctgt 


ttgtagcaac tatgttgtac 


cacaaaatat 


ctgagccatt 


92700 


acaaaacaga 


agagtcatta 


ggcattttat 


ctccaaccca 


aagcatacat 


gcatgtttta 


92760 


aaatctcaaa 


ttctcctgac 


tttaattgtg 


catattatgt 


tcaccaaacc 


ttttagaacc 


92820 


tgccttgttt 


tttttgttct 


ggtctgtagc 


tgggagtcag 


agaaattcaa 


ctgtgattgg 


92880 


aaaaatggtt 


actggcaagc 


tatagagttt 


ctaagccaga 


aggtgaagaa 


atactacttt 


92940 


tttaacactc 


ttggcctggg 


actagactta 


cagacatgat 


caatattgaa 


aggcaatttg 


93000 


gaggtataca 


ttttaacatg 


tcctcagtct 


ggagttagct 


gtgtgtccag 


tttcctctca 


93060 


gtgtgagtca 


agcaatagca 


ttagaaagtt 


atgccccaag 


tctcatcccc 


tcctattgaa 


93120 


acttggcaca 


gcacattcag 


gctgtaagcc 


accaggtcac 


agccccctta 


aaggattcgg 


93180 


caacagctgt 


ggttgctatc 


acatggtgtt 


gatcatcgtt 


gggcccctca 


ctgtaagaaa 


93240 


gacattgaga 


ccctggagcg 


tgtccagagg 


agggcaacaa 


agctgttgag 


gggtctggag 


93300 
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cacaggcctt 


atgaggaacg 


gctgaaggaa 


ctgggattgt 


tcagtctaaa 


gaagaggagg 


93360 


ctcaggggag 


accttattgc 


tctctataac tacctgaagg 


gaggttgtag 


tgagctgggg 


93420 


gtcggcctct 


tctctcgtgt 


gactagtgat 


aggactagag 


ggaatggctt 


caagctgcgt 


93480 


cagggaaggt 


tcaggctgga 


tgttaggaaa tactacttct 


ctgaaagggt 


ggtcaggcac 


93540 


tggaaagggc 


tgcccagaga 


ggtggtggag tcactgaccc 


tgaaggtgtt 


caaagagtgt 


93600 


ttggatgttg 


tgttgaggga 


catggtttag 


tgagaaccat 


tggtgaaggg 


cgaacgaatg 


93660 


gttggactgg 


atgatcttct 


gggtcttttc 


ctaccttagt 


gattccatga 


ttctatgatc 


93720 


attacactgg 


atttgatact 


ctgtgagcaa 


aggcattgaa 


gtggtacaaa 


aaattcaaca 


93780 


ttctgcatta 


aattgtagaa 


tctggcaagt 


ggaaatcgtt 


ttctataggc 


acagccacgc 


93840 


actcagaatg 


tgtttgcaat 


ttgcttgcat 


ttagtcttct 


gcaagtaatg 


actgctttct 


93900 


gtatgcaaat 


gattgatcca 


tgtgaaaaaa 


tctgcttgtg 


tatctgtgaa 


tcaaatgcat 


93960 


tgctttataa 


tgtgcatttt 


ggatcattta 


tttgtggaag 


taagtgtaaa 


aaacagagcc 


94020 


tgcaattgtg 


cttctgcagt 


atacaaggcg 


ttactcaact 


ccagctgtac 


agtcagtcag 


94080 


gccctgagat 


aatctagact 


tatactttcc 


atagttatta 


taattttgtc 


tcttactaaa 


94140 


tctttgattc 


tgcttgtttg 


ataaagtaac 


actcattttc 


tatatagtat 


tacaatcgct 


94200 


tctagaaggc 


attacatcac 


tgaattcata ggctttctga 


aaaacagatt 


cagaaatcag 


94260 


attttctaac 


tgtatttttc 


catgtatatg 


tattggagaa 


ctagtgaaga 


acgtgtttaa 


94320 


tatacagaac 


tacagataaa 


tccagaaagg 


agaagcaaca 


ctcaaaataa 


ggatgtggca 


94380 


atcctaaata 


ggctgtaagc 


tggcttgaag 


catgtccctc 


caaaaaagcc 


atctgagaga 


94440 


aaatttctca 


tttaccatgc 


atgtgcaagt ttccaaactc 


tgcaggtatt 


ttattttctc 


94500 


cttttgcaaa 


ttcccttgca 


gatggcattt 


tgctttgctt 


gctctgaact 


gcgttgatgt 


94560 


gagcagtgag 


gtgcttttct 


catgctgaaa tacaagaata 


aagaagattg 


aagcacaggt 


94620 


ctgtgcagaa 


catctagtga 


atgtattcag ggcatgccaa 


gcacaagcta 


ttcaaatatt 


94680 


gctccctgaa 


aatgcagtca 


gagtggactt 


catgttttta 


agtggaagtg 


gtacataact 


94740 


tctgtagtgg 


agaaatcgtg 


tgactcaggg ggtgaagggc 


ctatcctcag 


ttaatcccat 


94800 


attcttgttg 


caatatgggc 


ctgcatcttc 


cagcactgtc 


agactccagg 


ttttagcata 


94860 


agatcagtgg 


aaaaaaatat 


acacaaatat 


accccttgct 


tctgaagctc 


tgccctaatt 


94920 


gggatgattg 


caaataaatg 


aaaaaaaaaa 


aaagggaaat 


tcaaatactg 


atgataactc 


94980 


tgcagttcaa 


caaccaggac 


acctagtagg 


tgagttctgg 


cttccagtcc 


ctgctgctag 


95040 


gactattctt 


gttttaatgt 


ttaagagaaa 


acaagtattc 


acacatgggt 


gagtacccta 


95100 


gcaataatga 


cagagaacta 


ttctgctcta 


tagcattctg 


atagtatgaa 


tctcgcctta 


95160 


attccatagt 


cttctcttag 


taccacgtcc 


cccagctcct 


gttgtctgaa 


ttaagcaatc 


95220 


actgtgtgac 


acctacgtcg 


gagcttagct 


ccattacact 


caatgaaatc 


agttgctggt 


95280 


cttctgtgga 


aaatatacta 


ttgcgccctg 


agcagtgctg 


agcacagcac 


ctgtttgcct 


95340 
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aatattaatg cagcactcag accacaacca gcctcaagac actcagcaga aggaatatta 95400 

tgaaaacagt aggtgctgct cctgaagcat aacagcctcc agagatggaa gacaagaaga 95460 

tgtgctttgg tagtgtgtgg tgctcatttc cttgttcatg aatgatgatg ggaatgactc 95520 

tggaagacac accagaggcc tctggtgtat accccatgcc tccagcctgg gcaactcctc 95580 

cttgctgcct ttttgacttg ttttgtgcaa gccatccatc cagaggtgca gagtgaaaac 95640 

aaccatggag ctcaagaaga gcctcatcag gtccatacac acttcaaacc cagagcaaaa 95700 

cattggagcc tcgggctcac tgcacagttc tgctgaaaac tgtgatgaag agctaggggt 95760 

tagaggaaaa atgtgctgta gttatcagtg cagctccatc atctgttccg ggagcatcaa 95820 

ggcttcctgg agagaacatt atcagaagga cacaaattat tcagtgagag ggagaagtgc 95880 

gcctctgaac gctctgagtc agatgcttat tttgtgaatt tttctgtttc cctcttcctg 95940 

ttatgcttcc tgcagatact tggcacatcc ttgaggcgat tcagcaatat atgctcatat 96000 

tcagccacat ctacagagtg cctcctccct gagaggagaa aaatatttgt tttagggggt 96060 

aaaaccagaa tagctgtgct tggacctcct gctctgctgt gggacaagag aagctaggct 96120 

cctggtaacc tcaggaggca gagggaggca cattataatt tggctaagac ttgaaaatgc 96180 

aatttgttgg tatatttggt aaatatactg atggcctagt cccataaact accttctaga 96240 

tgtggagtaa gtggtttaaa ggcatagcta agaggttgca gaaaagaaag gaccacatcc 96300 

aatttggtag caaccaacat ccagcattca cagactcatg agaaatacct tttaattaat 96360 

ttatttatat taaataaaaa aaaaaaatcc tttgatgact caccctgctt ttcctgttac 96420 

tctcagttgg gaagaaagta accgctgggt acatactact gcaatttcag agctgcagac 96480 

ttgaagagct ttcccaagtg ctgagatatg caggaaaaaa aaccctgtaa attacagtac 96540 

caggcattta attttgattg ctaaataaag aagactcgtg acagtccatg actacgtctt 96600 

ggagggctgc aattacatat gaaatatagt ctgaattagg agagttactg gcagaggcaa 96660 

agtttgcatg ccaattaatt ggtaaaagga gagtacgcca aacacaggct gtggactgct 96720 

ctgatgaact gagtatgtaa aaaatagcca tgtgtgtttt tcagtgaata ccatggtata 96780 

tgtctggttt gagtcaaata tgtattaaaa tgaaaaaaaa aaacaacaag aacagtgaaa 96840 

taaacagtgc tagcatatat tagcttgtat aatcagacct atatagtttt caaataaatc 96900 

ttcaaggaga acaaaatgta tagtatgtat gataaggata agtactataa aacatcatca 96960 

tgaggagtgc cagtctgaca acaggaaaag gaattcagcg tgtgaatgaa ggggaaagtg 97020 

tgactgaaac aattgtcact cagcttacta cagcagaagc aatcatttat gatcttagat 97080 

ttttttttat tttttttttt aacttgcttc agagatatct aagtaatctc aaaaacagga 97140 

acaaaatacc aacgcaagga aaaattctat tttcgcttca tataatcttt tctttttttt 97200 

tctagttgca ttcttaccta aaaacaacaa caacaaaaca tttaaacaat gtttaaatgt 97260 

ttactgctgg tttgattaca tcaaaccgag ttgttgctgg agatgaccag ctatcaaggt 97320 

gcataatgga ctggcagatg tgcttggtct taccccaggt tgctgtgcaa acacaataca 97380 
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cattgacata taagctacta tgagttctga agggcagttt agacattaat tctactccag 97440 

gccagacacg ctgactatct gagtggttta tagcaaggga ctggttgact tcaaagtggt 97500 

tccaagtcaa ccactgccaa gtgcttaaga ctgtgtatgc acaacagagc tgatcatctc 97560 

cagtgcaaca aataacatga gagcaaaaag catctgaaat tctgtaaatg aggctgttct 97620 

ggccacacct tggctcatta aaagactttg agagatgcca gaatagcctc tgctaaatgt 97680 

gatgcagatg gacaagctat ggaatgaatg ggtccagggc ataaggaaac attaccctca 97740 

agcactacac aggagctgct gaacaaccac aggaaaggaa atgtgaaaat gtgaacagat 97800 

aaatgttgga aagagccgca tttctgctgc ttactatgtc cttgattatg ccaacattaa 97860 

ggaagaatgg caaaccccgt gaattggttt aggaacagct ctacaatgga ctgcctgacg 97920 

gaggaaaagg gcagcagagt ccttgctgac ctctttctgg tacaaacaca gatctggaac 97980 

agagtttaac caattagtct tgcttgcatt catgcctctt gaatttcaag aggtgccttt 98040 

gatttcccct ggcctaacac cccatctaaa attacaaaac catattttgt ctgctgagga 98100 

ctgtgcacgg atagcccgtt ctggtcaaca tactcaggct gcttctgcaa caagttttgc 98160 

actggcattc agtgtagaaa aaatgcaaga cctgtgtagc ggcagacttc tctctggaga 98220 

acatgtattg cctcaactat cttacctgtg caaaactgtt gtggtgactg tgctattgca 98280 

gaggtagagt gttcaaagaa ggcaaacgta ctgaatgaga gaacacatca aaaacacctt 98340 

catgccctct tctaggggag acagcgaaac aaaatgttta ttgagaaaat cttggacatc 98400 

agtccaagag atgaaaacac tgtccatatg tgcagggctg gttgtgttct acaggtccat 98460 

gctgcataga tgaccacaga ggacaaagac attgaaacca agcatacaaa gggctgtggg 98520 

tacccaggaa agttcttcaa ggaagccttg aagggatgtt tgagtaccca cctgacctgt 98580 

agctgcaacc ctgatgtaaa catgtgaaaa tgggagcata agagaagaca ctacacactg 98640 

caacaaaacc tgtgcccttg gggaggaaaa gtttgacaag ataaagtaga agctattgaa 98700 

aaaggaacat taaacaagac aggaggaaag cttcttacta tctgtagatt tccctactcc 98760 

cgacatgact actgtcatgt tgacagataa aaaatactca ttttgagtgt ggaaactgaa 98820 

agccattcca gttatcatgg tctgcacata cacacatgac tgaatttcag caacacaaaa 98880 

cacagtgctt atgataaagg agctcccttt tacctttacc agtgggtacc accaccactg 98940 

tgtactgtct gtcttaatgt gcaaaaattt gggatttcta ttattcattc ccctggcctt 99000 

aacagaagct ggattttttt ctttagtgct catcaagggc attattcaat aaagagtaat 99060 

agctttttac aattgactaa tatttgatat tgtgcattat gattgtctaa cagaccatga 99120 

atgttccttc agacagattt ggtagtttat ttacctgtca tagtaaaata ggaggtacag 99180 

aagatctatg agaatagcct gtgcatgtac aatgggcctt gttgccatga cctatgaaga 99240 

atgaaaatca aaagctgacc accaatcatc ccttgaattc cactggctgt tcagcattca 99300 

cttctgaata tctgaatact ctggagtctg ccttcgcaaa gcagcaaata ctttcagact 99360 

gttccctaaa tctcttcctc ttacctattc acactgagtt ctctaattca tcccaacacc 99420 
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tctgctctga attttttcat aagaagcttc agcaaaatgt gctttctcct ctcaaatgta 99480 

tgctgcagag cctttggctt acagtggata tagcccaaat tccagtgaaa aacttcagtc 99540 

ttgcctaggt gcagaaatag atggagctgt gcttttaaca agtactaact ataagcttct 99600 

tcagttctca aactctttca gcagaccaaa acatttttca gtacagtttt gttctttaaa 99660 

aaactcataa agctttgttt ctattcttac atggaaagca atccattaca aaatcctcaa 99720 

aatagaatga ccatcctgca gctgactctg cttggaactg cattattttc tctacatcaa 99780 

gtggttgcca tccatgagaa gcatccctat gtttctctgc acactgcagt aagagatcac 99840 

gtatatatca cacttttccc ttcacccatc ttgggagcag tgctacagta aattgtataa 99900 

ttacagtgcc ccagagatga gaagaaactg aacagcagga aaggagacac agtcttaaaa 99960 

agaagaatgt tttccaggaa ttgatgcact ttcttgcact ccttggtaat atgggactac 100020 

tcttgcctca cctttagcag tgggtgctca ttaaatggtg aatggtggtg ggtcttctgg 100080 

ttctccaatc atgtcttatt ttctcataat attttgggat ccttagattc atctgactgt 100140 

gagaatcact tgatctgatt ttttttttta atctgatttt gcagctaagt ttatctgaag 100200 

tgtattatgc ttatcctctt ttttaagggt tttttttttt tttaaagtgt gtgtattcat 100260 

tattcgtttg gctctagtta tcgatatggc tcaatcaaat taatgtttaa attctgaagt 100320 

agagcatgag acatgctaga cttgaagttg gtacagcttt ataagataca agaaaagcct 100380 

gaataattac attctactat taggtttcac ttcacaaaat aaatttggct ttctccaagt 100440 

agagtaccag tctaatgttg gcctactcag tgctttcaag cacaatgaat caaaaggcaa 100500 

tgacaaaggg tagtaactca aaggatgact cttagaaggc taacaggggg agtgtccgaa 100560 

agggtactgt atatatcacc aaggactcag agaatctgtt caggttcaac tggcaagctg 100620 

gattattacg agcctctttg atgtttttct gtaagtactt ctccaaataa aatgtaactt 100680 

ctaagttgta ttcttgaata tggaaaaaac aaaacaaaac agaaatatat tatgtaagaa 100740 

cttagaggaa aaaagggccg ccttctattt tatgatgttg gcccaccaca tcagaggcag 100800 

atggtggtgg tatggcagta gaggttgaac cttcccacca acaccccgtt atgtgttgtt 100860 

gctgtgtgac agatggcagc agaggggcag tctgacagaa tggcgtctca catggaagtg 100920 

tgtatgaagc aaaggtgtgt cactgaattc ctccatatgg aaaaaaatgg cacccactga 100980 

cattcatcga tgcttgctga atgtttatgg agacaaaaca gtggatgtga gcacagcgag 101040 

gcagtgagtg gtgtgtttca gcagtggcga cagtgacagt tgttcacctc cactggtaca 101100 

gaattttgcc agcaggaaat gcagattctt gtacattgtg ggcaaaaatg catagctagc 101160 

tgtggtggct atgttgaaaa ataatgttcc gtggctgaga atctgctcaa ggaaataaag 101220 

ttattgtaat cattataata atattataca tgtgctttct atctattgta gtttacatga 101280 

aaataaatag gaggcattac ttttggtgtg atctgtatac aggacagata tgtaaaaaat 101340 

atttctggaa gagaaaattt ttgttttcac agtctcactc cctgcagaac acaggtgagg 101400 

tacagtagga taattcacag agccttgtta gcaccaggaa cctctcaggt tatgtagtag 101460 
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atcacatttg ctacaaacta tggatatgct attattccaa cttaaaactg ttttagaacg 101520 

gggagggcac tattcagctt tcttgttctc ggattaaaga aagagaagga ctgtagattt 101580 

caataatttc ccctaagtct tgacattaaa ttgcatgtac aagaccttca cctggctgat 101640 

ctgatgcagc tttacagtgc attaagtaat ttagccagac tgtgtattta cggtatatag 101700 

acgtttgttt gtttttgtca acaacaaaaa aaaggaatca gcagagatta aatgtcaaaa 101760 

aatgagaata tagagaagaa gcccactaaa gctatagttt ggcatctaag caactggcta 101820 

gatttacaaa gagattcact ctataaatta caggacagca acctccaatt ttatggccag 101880 

ttgtacaaag aagcagtttg aaacaagcta agactattgt ggtttgacta catttgattg 101940 

aaatatccag agtatggtcc agagtagaca cagaagaaat gaaatgtgtt tacattgtct 102000 

caaaaatcat tcagagttct ctggatggct atagggaaac tcatctagtc cacactgtat 102060 

tcatcatact gaagcacaga tgaaactatc tatttcctaa agggcaagta caagatagtg 102120 

tttttataat gaaccagtac ctttctgaag gaaagtaaac atgcatttgg gaaacaatgg 102180 

gtcagtcttt acaatatttc taatgatcac agaattttta ggctttacat tattgtttca 102240 

gcatcacaga aacagcaatg aacaagcagc ttctgggcta caggaagtac tttttactac 102300 

aagtgccaca cgtcaacacc acacagtaat aatcctgttt cttttagaca acaccgattt 102360 

caggatgggc tccatcagtg cagcaaatgc agaattttgt tttgatgtat tcaatgagct 102420 

gaaagtccag cacacaaatg agaacatctt gtattccccc ttgagcatca ttgtagcctt 102480 

ggccatggtc tatatgggag caagaggcaa cactgagtac cagatggaga aggtaagtta 102540 

tgcaagtaaa tacaagctca ttttgatcct ggttaacaga acaagttatc catgaagatc 102600 

tttgagactt tctcccctta aggggccagc tgctgtacat ttgccactgg atttgaactt 102660 

ggctagcaga aggacattga gccatgaggt ttggatctgg aactaacttt tcacttattg 102720 

cttttcacta caaagggtaa caacagtttc tactaaggag gagatctcct gcttcagttt 102780 

atattatctc acaaacctga ctccttccag ataaaatgaa caaattttca tgtataaaag 102840 

atgaaacact cagaaatcag gagtcacagt tctaagtaca gtatgggtgt agctggtttc 102900 

tggatggaaa aataagtgaa ctaattggaa gatcctatca aaaaatgttc agagcagcac 102960 

atgcagtaaa aaaacaaaca aacaaacaaa caaaaaaaaa ccacacaaat ttcaacctcg 103020 

aatgaaactt ctcagttcag ccattggtta tttcaagccc agaatttgaa cacaaaatcc 103080 

agagactctc agtgaacttt gcatacttca tttcttcttc tgctacttcc atttgcaggc 103140 

tcttcacttt gacagcattg caggacttgg aggaagcact cagacaaagg tacagaaacc 103200 

taaggtacat tatttttctc tcacattcac tttttttttt tttcctgaaa acttaaaact 103260 

gttctgactg tgcttccaat aggtccagcc ccttcccaaa ccctagctaa tgctctcaac 103320 

acatgatatg caaatgaaaa actaaaattt gttctaaaaa aaaaaaaaat aatgacaaaa 103380 

agaaggctca tttcacatgt tgcaccagaa aaagtgatag gatagttgaa ggacattttg 103440 

agcaccagga taccttccta cattgataag aacttgcaca cttgtagggc ttgctggagg 103500 
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accacacatg aaccatgtgt gcttttctcc ttggtcactt gatacatttg gaaagataac 103560 

acaagccatg ctcccagggc tgtcctcatc cacttgggtt ctccaagcac aatgtggggc 103620 

ttgtaaagga caagaagatt tttccgtttc cttttctttt tcctttcccc ctttcacttt 103680 

ttcctttccc cctcactttc tctctccttt tcccacttcc ctctttcctt tacatttccc 103740 

actctcctct cctctcctct ccactccact ccactcctct cctctcctct cctctcctct 103800 

cctctcctct cctctcctct cctctcctct cctctcctct cctctcctct cctctcctct 103860 

cctctcctct cctctcctct cctctcctct cctctcctct cctctcctct cctctcctct 103920 

cctctccttt ccattctatt ctttttgcta gagcatttag atggttatgt agaacaattc 103980 

acaaaacaca atcagacaaa tcactcacat tttctgtttc ttatcaccaa gactgagtgt 104040 

caccaaatgc tatcagttgt acatgcttat atagaacatc tctcccatgg agcttttaga 104100 

ctctaatgta ttttgtttgc aaatgtctga acactgtgtg ttttcctacg tgatctgtac 104160 

tttataaata gttgtctttc tagtaaaata agctaacatt tatacccttt ttcctcctct 104220 

tcaacaaccc agtgtggcaa atccgtgaac atccacctac tctttaaaga actcctctct 104280 

gatattactg catcaaaagc caattattca ctccgcattg ccaacagact ctatgcagaa 104340 

aagtcacgtc ctatcctacc ggtgagttgt acaacagagt gattttttgc tagatcctgt 104400 

ataaacccat aatccaggag tactgcccag agtatctgtt aatccaactc acctcagcgg 104460 

tgtggacttc cacagctttt catttgacat tctcaaaata aaacacacaa atattctaaa 104520 

tcaaatacat tttatcttta aaaatagaga aaaatgcttc aaaaataagg attttattat 104580 

aacaaaacag ttgctaatgg atgctaatgg acctgaagct gtttttggat tggtatttct 104640 

tcaagaaaat atttcagcat ttctactacg taatcttatc tggtaaagta ataaaaatct 104700 

taaagatctt aacatatcat gcatcgaaat aattttgctg gcccagtttt aaccatttcg 104760 

tccaggaaat aagccatgaa aacagtctaa tagcataatt ataaaaatca tggaacattt 104820 

taactgcatt ttatttcacc ttcacagtct ttttaaaaac tgacttggta gctacaactg 104880 

ttgtctttac agatttacct aaagtgtgtg aagaaactgt acagagcagg tctggaaaca 104940 

gtgaacttca aaacagcatc agaccaagcc aggcagctta ttaactcctg ggtggaaaag 105000 

cagacagaag gtaagctcag aggagagttt ataatatact tccttgttac tactttaccc 105060 

aaacaacttc tggaaagact attccttcca tctccattaa tggatatttc ctgtggaaac 105120 

tgatgactct tgcacacttt tttgtgtgcg gtgacagtga atttaaatat atatgacaaa 105180 

ggcagggatg ccactgtgtg ctttctgtgt aaggagagca taactcatgc aagattggtc 105240 

ccagcttccc tacaatattg gcatcatttt acaagcatat gctggatgga taagaaatgg 105300 

gcttccgtgg aagaaaataa tgtggccact aagttggtgt aagaaaagga atgattaaga 105360 

gtgtatgtac atttatcagg aaaaaggtgg gaagaaaaca agaatcaagt attagaagga 105420 

agcacagtga gaggcagaag atcggtatcc ctgctttgct tttcacttcc ttctgttcca 105480 

tgcaagtctt tttccaagga cgtttgagat attcctgggg atgtgtgtga acattcaagc 105540 
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ctacatgcct ccttacagaa atgcctggtt aagggttagt tgttctgtat gaaatcactc 105600 

gtgaacttga attccacatg ccatcattta aagaacagga agtcaactca agcttgctgg 105660 

ttgacatcta aaacaaaaca ctcctgcaat gaaaacaaaa ccccacaaag cagcaccctc 105720 

caatcccttt gcctcataca tgcaaaccag acagactgtg tcttagcact cactgctttg 105780 

cttccttctt acaggacaga tcaaagattt gcttgtatca agctccactg atcttgatac 105840 

aacgctggtc ctcgttaatg ccatctactt caaagggatg tggaagacag catttaatgc 105900 

agaagacact cgagaaatgc ccttccatgt aacaaaggta ggggacgtgg tcaccgcttc 105960 

tgggcaggac agaaagccat caagggtgcg acatacacca tcctacagtc attggtccat 106020 

ggttcttctg ggcccctcgc tgacagggca tggggctgag cccaagacag gctggcaaaa 106080 

attgtgtctg accaggcatc caaagcacac ctgtagacaa gagaggaaaa tggagacaca 106140 

gcttgaggat ccagcccagt tcctctgaag gacttgcaca tctgcctgct tcaagagaaa 106200 

ctgccccctt ctcacattgt ctcatgcttc tgttttgcag gaagaaagca aacctgtgca 106260 

aatgatgtgt atgaacaata gctttaatgt ggccacactg cctgcagaga aaatgaagat 106320 

cctggagctc ccatttgcca gcggagacct gagcatgttg gtgctgttgc ctgatgaggt 106380 

ttctgacctg gagcgggtac ggccctggca ggggaagcca actagttcgg agttcagtgg 106440 

gagctggctg ctgttagacc tttggctctg ctctcgctcc ttggctgtgc tgtgctggcc 106500 

aggcagggga gcacaacagt ggcccaggtg cttccaggcg ctcaggcaga ggttggcctc 106560 

taaggagagc cctagcctca atgttattaa acaaagagta cagcaaagaa tacaaaggta 106620 

aaggagcgta gggctgctgt aatgttatag aagggcacgt atgggcaatt cttttcattg 106680 

agaggcagtt tcatctggcc tcttatataa actcttcagc aaatgttact agaattgatg 106740 

aggttcaata atccctaata tttttgacaa tattctcatc aaatatttta aataagctgt 106800 

tctcagaata ccaaagtaga tgcagaaata tttgtgtttg tttggtacta tccactgtat 106860 

ataaattgtc atggcatttt tttttttgca atctctttca ccagctgacc aatctgctat 106920 

gtagtgaaat tgctttattg ttctgtatga gacacgaaaa tatttgtaca gaaggggatg 106980 

tgtcaggtgg aaccaaataa aggagcactg aagaggaaat actagagaaa caaatgttaa 107040 

aataggaaga tgttgatagg atgcaccttg ggaaactttc tatttttttg taaaataata 107100 

gtcttgatta aaatgaacga tggaaagaag ttgcattctc atcacaggca ttttattctc 107160 

tccctctctt ttcagattga gaagacaatt aactttgaaa aactcacaga gtggaccaat 107220 

cccaatacca tggagaagag gagagtgaaa gtgtacctgc cccaaatgaa gattgaggaa 107280 

aaatataacc tcacatctgt cttaatggca ttgggaatga ctgacctgtt catcccttca 107340 

gccaatctga ctggcatttc ttcagcagag agcttgaaga tatcccaggc tgtgcacggg 107400 

gccttcatgg aactcagtga agatggcatt gagatggcag gctccacagg ggtgatagaa 107460 

gacatcaagc atttccctga gttagaacag tttagggctg accacccatt cctcttcctg 107520 

atcaaacaca acccaaccaa caccattgtc tactttggca gatattggtc cccttaaaga 107580 
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gagaaagagc tggcaataac acataccttc ccctcagaaa caaaatcccc ttaccgtagt 107640 

attatagcat aatcttatct ctttcataga aaagacatac ccgcaggaga ggagacagca 107700 

cgaagcacac ttactccttc ccttcttgta ttaatttcag aatggcttga tatgagcaaa 107760 

gactgagcca atgagatggt gagaatgaag acacctatca gccattaagg tgataagtga 107820 

ttttcaccca aggaataaat agtaagaatg accctaagtc cttgggagcc cgttacatag 107880 

aaagcaataa gctttgctca tcccattccc tggtaacata ctgctgacaa acccacgtta 107940 

ccattcctga aacatgggct ttgagatctc cagtctagag gggatgtttg tggaagagtt 108000 

tctggtgtgc agattattga tttgtgatta tgtcaatttt atttttcttt atttggtaat 108060 

tgggcaatgg tatacatgtt cactatcagt ggagttgtcc tctcaccata agtcctctca 108120 

cctagttctg aatttcttgc agaggttttt caaagtcctg aagagtctcc cttccattcc 108180 

agagaaggga aatagatcca gttttgcata ggtgcagtta tgccttttct cagagtgcag 108240 

attcaaagcc tgaaccatag agatccagat gattcttatg acccagaact cagtgagatc 108300 

cactgggcga aagaactgtc taaatttttg tttaaaactt tggaagacac ttcaaatttg 108360 

agaacacctt tttggtgaaa aatcctgaaa gtgttgtaaa atacttcttc taagaaacaa 108420 

attagaatcc tatttttttt ctgtcttctc ttcctacgta tagattgtca actgcagatg 108480 

tggatcctct ggctcaatat tatattctgt tttcattcta atcacccatt attgatttag 108540 

atacatacag ttgattttgt tttggttaac ataataaaag aaaaccacaa acagttttca 108600 

tgtaaattat attagctttc tgaaccacac actcttaaaa atatctttac attttaacaa 108660 

ctgtgagtaa aacgtgattt agcagaaaaa tgtattctta gaatgaataa agcatgcaga 108720 

tatgaagttt ttcaggcatt tatgacattt ttaggagtac ctgttttcaa gaagaacttc 108780 

accaaagacc tacaaccaga gtttgattct tctctgtatt tcagatgaca agaagtaccg 108840 

caatagaata caaagttatt cctatctatt tttctgtgcc attccaacag gcattaaaga 108900 

tgacctggca attttttctg gtaaatattt caaggaacaa ctattctaac agttttaccc 108960 

ttttatacag aatcacagat tcataggagc tggaagagac ctcacgggat catctagtcc 109020 

aaccccatat gatttcatca ttttatatga tgaaataatc tggaattcat ataacttgaa 109080 

aggcataaga aaggttaaat agatcaacag ctactaggca aagcatttcc ctatgcagag 109140 

tcttgagagg aggaactctg atgttaacat cgcctatttc cacattagtg ttaccactgc 109200 

agtgtcaatg ataaaaggtg atctgtagag taaaaacagc tggtgctaca ggtatgacac 109260 

ccacattttt tgtagattat caggatactc acaatacaga cacagctgtt tttcaatggt 109320 

aaaaccaaac attttaccaa gtatacttta ttttttgcct ttagaaatgg aagtagtgag 109380 

aagaacagtt ccaaggtaag agaaatatca gcatctcaag gtttaccatc agtagtttat 109440 

tcatctttca catctcttat gtccatagaa tcatagaatc atagaggttg gaaaagacct 109500 

taaagatcat caagtccaac catatcctaa ccatactacc ccaactttaa caaccctctg 109560 

ctaaatcaga agcagttgat ccactctgct aaatttaaaa gccagttcac ttaaacaata 109620 
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agaaaactag aggaagatta catttgcaag ctactcttct acgattaata gactggaaag 109680 

tgcataaagt acaaagatat gcctcaatga tctgaaaaac agctgaggtc aaaatggaga 109740 

aatggggaaa aaaattgaag gtttcttgct tgcaagcaaa tataaagctt cccctttctc 109800 

aaaaagaaaa acaagagaca ggaaagagtg gaaattcagc aatactgaac aaaaattgca 109860 

acaaaatact gatggccaag ccctggccac cactgaccag gcagggggca gaacatacaa 109920 

agggcaggat aaaagtgttc tccatgaagg gggtggcagg cctggtgggt gggatgatga 109980 

acaaaataaa tacactttta aaaccttcct gtggctccag gcatttttgc cccagcctac 110040 

caaaggctat tagcattttt attttctgga gtattaagac cctgtttctt tgacagacta 110100 

ctgtgcagca actgacagaa ggtttagtgg gtaagtcaca ggggataaaa tgttcagcat 110160 

agaccaaagc aaaaacataa tgtcatgatg ggtgatcaac tggctaacag gttgggctca 110220 

aaggattaca gttactgggg ttacatcagg ctggtaggca gtcactaatg gggttctgca 110280 

gggctcaatt ttagggctag ttctcttcag tgttttcatc aatgacttgg ataaaggact 110340 

tgaagtcata ctaagcaagt tcatggatga cacaaattgg gaagtgccat tgactccctt 110400 

gagggtaaag aggccttaca gagagattct gaccaatcag agagcttggc aatcatcaac 110460 

tacataaagt ttaacaagag caggtgtcat attctgcacc tgggatgggg cagccttggc 110520 

tgtgtgtaca gactggagga caagaggctg agagcagtcc tgcccaaagg gacctggggg 110580 

ttctggctta cagcaagctg tatctgagcc agcagtgtgc cctggcagct ccaagggcca 110640 

accgtaccct ggggtgcacc aggcccagca ctgccactgg gtgagaggag gggctgtccc 110700 

actgtgctct gtgctatgca gctgcacctc cagcactgca tacagggttg gctgccacaa 110760 

cgtaagaaga acacaaaact attagagagc atccaaagaa gggctatgaa gatggtgaag 110820 

ggtgtggagg gcaagatgtg tgaggatcag ttgaggtccc tgggtttgct cagcccagag 110880 

cagaggagct gaggggaggc ctcatgacgg ctgcagctcc tcacaagggg agtggaggga 110940 

cagtgctgag ctctgctctc tgtgacagca tggggctgtg tcaggggagg gtcaggttag 111000 

gggttaggaa gagggtgatg aggccctgga acaggctccc cagggcagtg ggcatggccc 111060 

caagctgccc gagttcaagg aacatttgga aaatgctctc agatgtaggg cttggatttt 111120 

gggtggtgct gtgtggagcc aggacttgga cttgatgatc cttatgggtc ccttccaact 111180 

caggatattc tgtgattcta tgacaagatg cactactgtt ctatgtgtga gatactactg 111240 

ttctgtgtga gatactagta gccaaggcct tcacagggcc tttctgaatg tgcctccagt 111300 

gaatggtcac cggagtaatc ccctctgtca acactgagat acacatctct gtcaccatct 111360 

gtgacaggct aaggcagcag tgcaggcaac aatgtcaatc tcttcagaat ggcacagcac 111420 

tgctgcagaa aggggtctgg tacgctgtga gcttctgtct gaaaaacctt gaccaaacac 111480 

tggtattctt tggactaagg aagcaacata attccataga acaactgagt gggaaatcac 111540 

cactgatagc tattgcatca agttctgcaa cagcaactaa gaaatcactg gcaatcactg 111600 

tgggcaagac aaacactaaa tggtcaataa gctccctgct actagaaaaa cagtggaaac 111660 
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ataaatgaga acaaaatctc tagttgtgca aaggtttttg tggtagaagg agagactttg 111720 

ttctatgagt tgacctggac ttcatattct tttggaaagg atcagatgtc aaagagtctg 111780 

ttagtttagg gacaggccca cagtgaaata cctggtaaag caaatagcag ctaagtctta 111840 

gctgacctct ttcatgggat aagcattgtc aaaaaaagga catgtcacaa caacacatcc 111900 

ttttgtgttt gtgcagcatt acacagctgc gctggacagc gtaatccatc agctatccca 111960 

gaaagcattt cacatacagg aaggtttgct taattttgct agacttcaca acagatctaa 112020 

acttgataag tagactacac aagaagtcca atattctgat gattctgcac tgatgactga 112080 

ccacgatcca atttttcact tactgtggac tgatttttaa attctgcaac tcgctttaga 112140 

ttaaccattc atttgacaaa aagaaaaaat ccaacagcaa cagttggctg ggatgtatat 112200 

taactttttc tggaggaatc tgctgcgcct tctctatcac aaaacaaaaa tattcctctc 112260 

agcactgagt atatttaacg cagagatatt ttgaaagcca tataattact aacaacatta 112320 

gtgctctgaa ttagctatta tgacacaact gtagtatctt tgtagatcct gagttgtagg 112380 

ctgtctatga tggcccaaac atatgattca ggcagatggt acacaaatgc ccagggagct 112440 

ctcctatagc aagctgtagc atgttgctag acagtttgat agtaaaagga tttaagacat 112500 

aatcaaaagg tagaggagac acagtacaac ttgttggata atcctttgac ttttgagctg 112 560 

tcaagtatac aaccacggac cattgcagtg agtattaaag cctgtttgaa cagaaaacat 112620 

gctgattgct agccttaagc aagaaaggga gaaggggcag cagccacaga aacatcttgc 112680 

agtgtgagga gtgctctaaa ttgtgtgatt aaagatattc accatgaaca gacacattca 112740 

gtcacttgat atgtcttcca ccagcacaga taccaaaatg gaactcacga cagtggtgag 112800 

taatttacat attgttgaag caagagaata gctcactccc tttataatag gtttgatgtg 112860 

atgggctacc aataagagtt aaggcctaat gatctttact caaaagtatt gctgctgcat 112920 

agcaatgtct gcaccagact ggactgggct atagatggta tcatgtaaca tactagttgt 112980 

aattaagtgt atcagacaga ctgaggtctt cattattagt attgctctag catcttcagc 113040 

tgaacaagac taatgaggac tctattaggc agaaaggtat ggactattca gaggctgttc 113100 

actttcacag acaactaaaa gggttaagga gtccacctct tttctccaga aaacataatt 113160 

tgttctagac aatttcagag gcattttgta tattgacttt ggagttctgt tttaaaatca 113220 

gagcatactc agaggtcaaa gtagtttgtt tgttgcccat tcttttattt caaaggattt 113280 

atgagattgc tttatgcttg ctattgtata ttatgactgt cctgcagacc atgaatgttt 113340 

cacctgatgt ggcatgaagt tacttgtgaa cgatctgtaa gaatgttctt tgaatgtgca 113400 

aaagacacat tttgaacctc acatctggtg ctgtgacctg tttgaaaaga acaactcaaa 113460 

tcaacattca aaactagcag tgagttcgaa tacttctctt gtagcttctg actggagtct 113520 

gaatatccta atatctgaat ttaaaaagca acagaagtct cttctctgct caacctcttc 113580 

tgcgacagta catctttctt cagttctgta tttttttttt cttttaatac agatgctctg 113640 

aatattgctt tcaaaattaa tttggattca tacagtatgc ttgttgatac tttcctactg 113700 
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acaatctgca cagaccatgt tggcacacaa ggtccctgag ttagactgct ccagcaatgc 113760 

tagactgctc tgcaaaatgc tttatttttt gcaattcagg ctgtaagtgg catcaggcac 113820 

aagaactaga caattacata caagttttca ctgtaggtat ccctattatt tgcagaggat 113880 

ttggactaga tggtcttcaa gggtcccttc caaatcaaat gatgctttga ttctgtgatt 113940 

ttatgaaaag ttgcagtaag tacagggtgg gcataacaca gcaaggagtc ctgaatgtac 114000 

tgcatttttt atgttctcag aatggtgact gctagaggaa tctggactgt cagtactcat 114060 

agaggaaaaa aaaaaaaaaa aggaaattga cttaaattcc ttagagacat tgtgtacaac 114120 

taaatatcac actttttttt ttttgctttg ttttcactat ctgtgccaca gtatttggtt 114180 

ctgtgcttga attatactta gtgttcaagt ttcagtgaat agcttttatc atttttgttt 114240 

caatcttatc agtatactcc atccttttct ccaaggtgcc atatgatatc cttccttctg 114300 

gaacttttat ttagagactt ctttctttct ttccctcttc cattctctct ttctttaact 114360 

ttttcctttc tcctttcttt tttgttttct tttttttccc ttttatcttt ctttccttct 114420 

tttttcctta tttctttttc cttctttctt ccttttttcc ttcaatttct tttttccttc 114480 

catttctttc ttttttcctt ccatttcttt tcttccttcc ttccttcctt cctttctttc 114540 

tttctctctt tctttcttcc tttctctctc tctttctgtc tttctttctt tccctctttt 114600 

ttttttttaa ttttaatttt tatttttttt ttgtaaataa aggacttcaa ccaagtaaaa 114660 

gtgtgtttct gacactgagt tccatccatc attcagtttg gcaaacacag aataggcagc 114720 

atggggtgtg tcatgacatt atacaggata tatttcaagg agttctgcaa ggctgtacca 114780 

cgtacagctg agaagctgta ctcttatcat cacaggtgaa gctgataagg taagcatttc 114840 

ttttggttat gattcatgtt ctaacccatt ttttaaaatg atcataagac ttacaagaat 114900 

actgatggaa ctttgtggtt tgtcatcaag aacagtcaag aaacaaatga ttaaaggatg 114960 

acttctttaa aaatctattc ttaccttcac atttctgttc tgcattactg tactgtttca 115020 

cagcctgcca catatgaagt caaagtgtta gtacaaagta aagctatgtt tactaattct 115080 

gtaacactga gaagctggca ctgtactgag acaccctttc ttctttttca ttgatgccct 115140 

ttgtttctga tttagaaatt aaatgcagca ctgaatttgt ttaaattcaa gacttaagct 115200 

gagttgcatg gtctacctaa catactttct gaatgaagtt actgaatgca gcatggtcag 115260 

gtatcaacaa catactgcaa attaatttct gtgtattcta aaacaagcaa acgaacaaac 115320 

aaaaaacaca cacacacatg cacaaagcat ttgcttcaac agtatgtttt ttcaacaaga 115380 

tcatacatgg agcttaaagc ttaaaatata atactctgtg ggagtagtaa ataatccaga 115440 

agtttgccct ctatcacctg cacatgtgat tcaattaaga gagagatgga acacatgaat 115500 

gtgttgattc cacacaatga aacatttggc agaatatctt ggatttcccc tgtacttggg 115560 

aaattctacc ctaggaagat tctctctgct tgtgacaaaa tgggaagata taaggacctt 115620 

aatactgcac tttacagcac tgttgtctat tctatgttgt cttctttact aaagagtttt 115680 

ttttttcctt tactgttaga taaaatgata tgtgttgaaa ctacagggaa aatttcatta 115740 
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gaatgtcaga aaaaaaagac agaaaaaatg tttaaatact gacgatgtga agtatctgca 115800 

aatgaaacaa gcctaaacaa tcactgcctt attaaaaggt ggattttatg aaaaaggtgc 115860 

caataaaatt aaagaacaat tttgaaaagt gaggtataat taagtcaacc aagaatggaa 115920 

catgtaatat ttaacagaca tttgtcataa agcagatgag tttggtaaat cattatctct 115980 

ttctatcact gtgcttccat ttccctaatc tatttttaag aaggtaatga tgaggtttga 116040 

gacctctgat aaagtggttg gtataagaat ccagcttcca tttacatgaa ggtggagtaa 116100 

atccagaaaa aaacttgcgg tgtttttcca gacctaccca ctttatattg tcaataactg 116160 

tagtttggat cacagagggc tgatctgtta actggtctta aaagtgatgt taaaaactat 116220 

agtgaaaaac ctggtctgga gtctcaggtg aatgaagact gagaacaaac ctatgtgtgt 116280 

tttctttcct gcacaagatg ggaaacgatt gtcaatgagc ttctttcaag gcaagtcttt 116340 

gcaatatttt caacacagta cacatgtaca gaggataact caagtttcaa ataaaacagt 116400 

tgccagccta cacataactg gtacctatac aagattttga ttgctcacaa atccaagcac 116460 

acacctgcct tttaaatcca cactactgaa ttctacttac tgaaaataag ctgtgcactg 116520 

tgtacagagg ttcaagtgca ctgacttcct tggaatacaa ctaatacatt ttaatctttt 116580 

ctttagacaa cgatttcaga atggattcca tcagtgtaac aaatgcaaaa ttttgttttg 116640 

atgttttcaa tgagatgaaa gtccatcatg tcaatgagaa catcttgtat tgccctctga 116700 

gcatccttac agccctggcc atggtctatc tgggggcaag aggtaacact gaatctcaga 116760 

tgaagaaggt aagttgctta cattggtgta aagtggacag tggactctac ttctgcttgt 116820 

cattccttct aagtaataac atattatcta ctcatgaggc tctcacatat tttaattcac 116880 

cagatggatc atgaatcagg gaattgtatt atttttttct aaattctgac atcttccaca 116940 

taatgtgatc attttttttc catatttttt atttttgtat taaaaagata aaaccctgga 117000 

ggaaaggaag agggagaaca ttattcgcag tgcataatac acaactaagt taacatccag 117060 

atgctcactg aaaaaaatat aatctaagca aatagtgcta tttccaattt ctcagaaggt 117120 

gacatgaagt atgaaccagc tgcaagctta cttgcagcct tttagttcat ctaatctagc 117180 

atttgttgtg ggtttttttt ttgtttctgt ttttgagcca acagctctac cccgaacatc 117240 

acgtgtaaat tttaaatgca taccattttt ggtcacgctt gtgttttttt ctcactggca 117300 

ttttctcttg caggttcttc attttgatag cattacagga gctggaagca ccactgactc 117360 

tcaggtaaag atgtaacttc tctccttttg ttcctatttt ctcctcagga caaaactaga 117420 

actactctgc ctctgctcca agcagtttca gactgtcaaa agtggtggca atgctctcaa 117480 

accaaacaga tctgtggagg gaggaaaaga gtgtgtaact cactcttgtt taaagccagg 117540 

gaaactgact tggagatagg tttatttgtc tgtttaatgc accatcatca gactaggtct 117600 

gtgggtgaat tccaccatgg cctgactgtt agtgatgggg acagtccttt ggggtctgat 117660 

tttctagata aggagaaact aatgtgacat atcatcttgt tttcctgtca tcacctcagt 117720 

gtggctcttc tgaatacgtc cacaatttgt tcaaggagtt actctcagaa atcaccaggc 117780 
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caaatgctac atactcactc gagattgctg acaaactcta tgttgacaaa acattctcag 117840 

ttcttccggt gagttgaagt gtgacttaac ctcagtgaga ttgcccactg ggctcacctg 117900 

ggactcggct ctactgtgag ccacaatggg aattggtttg agccacagga tgagttcaaa 117960 

cctttctgtg gcttttagga ggaggctagg ctcacacaag gtataagggc tctggagata 118020 

ttcaagaccc atttggacac tttcctgtgc aatctataat gaacccctgc agggggttca 118080 

aattgatgat ctccagagat cccttccagc ccctgcgatt ttgtgactct gtaatatatg 118140 

cccatgcagc aactgctaca gggagcaatc aaaattgctg ctcattcact aaaaaatgtc 118200 

tcttaatgaa aaaggtgatt tgtaagggag gaaaatgact tgaaagcgtg acgactgaaa 118260 

ttgacaaaaa tattttgttc atcttttcta aaactagaca taaaataact cacttaaaga 118320 

aagttttggt tttgaaataa aaaacaggaa tgtaagaata cacagttcaa aagaaaaggt 118380 

aggcacgaag atgaggaaat gagtattgtc tgtccttaat aatgtttgca gaacagaagg 118440 

ttttatggta aaatgaagaa aatatttcaa aattttaact tagaatccaa tctgaagaca 118500 

aaagtgacaa atctaaatat gtgaagtagc cttgtccagc tttaagattc agttacagca 118560 

agagagctgt ttgacttgtt caagtgtagg gatagaagtt tcttttaacc atcactttcc 118620 

atttcattaa ttttgcattt catattcttc tattttaaag ttctcaacag tcaaacacaa 118680 

ttcttctgct tataggaata cttaagttgt gcaaggaagt tctatacagg aggagtggaa 118740 

gaagttaact tcaaaacagc tgcagaagaa gcaaggcagc tcataaactc ctgggtggaa 118800 

aaagagacaa atggtaagaa gtaaaaaaat agctgatatt ttctcctacc tactgtaatc 118860 

tacgctcttg tcttcttctc ctcaaaatgt gaagaaaggc atatcaagga acagcacttg 118920 

attattgcta tgaaagcaaa ctcccataaa actcaccatg cccttcattg caggcattca 118980 

tgcaaaccag acaggctgtg tcttaacact cactgctttg cttccttttc acaggacaga 119040 

tcaaagattt gcttgtatca agctccattg attttggtac aacaatggtc tttattaaca 119100 

ccatttactt caaagggata tggaaaattg catttaatac agaagacact cgggaaatgc 119160 

ccttcagcat gacaaaggta gggacatggg cactactact ggaaaaattc aagataaagt 119220 

gatccctact cacattgtct catgcttctg ttttgcagga agaaagcaaa cctgtgcaaa 119280 

tgatgtgtat gaacaatagc tttaatgtgg ccacactgcc tgcagagaaa atgaagatcc 119340 

tggagctccc atatgccagc ggagatctga gcatgttggt gctgttgcct gatgaggttt 119400 

ctggcctgga gcgggtacgg ccctggcagg ggaagccaac tagttcggag ttcagtggga 119460 

acttctgact gctttcagac ctttggctgt cctctcaccc cctggctgtg ctgtgctggc 119520 

caggcagggg agcacaacag tggcccaggt gcttctaggt gctcaggcag aggttggcct 119580 

ctaaggagag ccctagcctc aatgttatta aacaaagagt gtagctaaca aaacaaaggt 119640 

aaaggagcct agggctgctg tagtcctgca gcaggggatg ttggtatatg caagttatct 119700 

ccatcaagta ctagagacag atatgctagc aggatttctt tttttacttt gaagaaattt 119760 

caattcccag agatcaagta gagttcaaac actgttacca agtcataggg accaattctg 119820 
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ttgatgaccg ttaatagatt tttttcatga gtcacccctc caaataatta aatataattt 119880 

tttttttgta aatatgaggg atattttaaa tgatcatttc tcattgaatg tagaaaaaaa 119940 

taggaaaaat ataacaagaa aacaaacagc atttctgaga ggttagctgc aaacatctgc 120000 

aaatgagcaa aaatttgatt tgacataatc aaaaactgat ttttcagaaa agcatttgat 120060 

ctgttggaag aattttcaga tgacaaagtt ttggagagct tcatcaagac agatgatatg 120120 

taggctatag tcaggaagaa gcacaaggga taaacaaata gatttaagct taagcgtcac 120180 

ttctgttttg cacacaaata aatgaaataa atagcaacaa gtggtattaa tacagttggt 120240 

atggccacca tactcctgct ttatgcattt cattgtctct tctctttgca gattgagaag 120300 

acaattaact ttgacaaact cagagagtgg actagtacca atgcaatggc aaagaagagc 120360 

atgaaagtgt acctgccccg catgaagatc gaggaaaaat ataacctcac atctatatta 120420 

atggccttgg gaatgactga cctgttcagc cgttcagcca atctgactgg catctcttca 120480 

gtagataacc tgatgatatc tgatgctgtc catggggtgt tcatggaagt caatgaagag 120540 

ggcactgagg cgacaggttc aacaggggca attggaaaca tcaagcattc ccttgagtta 120600 

gaagagttta gggctgacca tccattcctc ttcttcatca gatacaaccc aaccaatgct 120660 

attctattct ttggtagata ttggtcgccc taaagagaga aagagctgga aataatgctt 120720 

accttcccct cagaaatcaa acctctttac tgtagtattg tagcataatc tcaatgcaat 120780 

attttatcca agtggaaagc cttcaatatc tagggagaca ttcttgaaga agcatgtgaa 120840 

atttcagatc tttatatgca ggaatttatt ctcagcttag attcaggatt catatccaag 120900 

gtgtacatat tcccaatgtg cttgaataac ttgggaaaca gggccagtgc tttggggttt 120960 

tttttgtttg tttgtttttt gtttttttgg tttggtttgt ttttttctgg ttggttgttt 121020 

tttttttttg tgtgtgtgtg tgagattctg ccattgttat tgagaatctg gtttctctat 121080 

aggagttctc tgaaataaac acagctttca ggaaaatcct ggtcctttcc attgaattag 121140 

ctgggcagtc atcctagaac tgatgcctgg acaacttgca gatgaaattt ttaacttcag 121200 

cagaccattt gtcttccagt aatccatttg gacttattcg tgctgcgtaa cattttttct 121260 

gagggagcat acagaaagtc taccatttct tcttaaatca tctccaaaca aaacatcttc 121320 

ctgattgata ttatttccca ttttcatccc agtgacatgt cactgatttt gtgaatgtta 121380 

attaatggtc tttctattta ttctaataaa agcttcgcaa acaaaacatg tcattaccta 121440 

ttctgggtta ctgtactaca caacctgaaa aatacgatat agcgggtaat aattattgac 121500 

agaggtgact aagctggtat gtggatccta ttttcaaaat cagaatgtac ccatatatga 121560 

ggtcactaaa tattttaaga ttaaaaaaaa aaaaaacaac tgggtttaat caaggtaaac 121620 

cctatagctc ctactcttca attgagcttc tcccaataca gcataccaaa taacaaaatt 121680 

ttttgaattt actgaatttc cagagaactt ttacagaaat cctctaaggg tcctcagtaa 121740 

atacatgaag gtgatgtgta caagatagaa ttttaaaata tgagaaaggt attaaaaggt 121800 

agactgcttc agctttctca tgctgacaag aatcacatga agaaatcttt ctattgcctc 121860 
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atgtgatatt cctctcgaga tgttgtatgc tatttcacgg tctttagagg aaagggtctt 121920 

taggttatat attatccaat tataatggtt actagtgtta atgacagttt tctgctagga 121980 

tacacatgca gaattgcaaa ttacagtatt ggtactgaaa atgtagcgat acttcagtaa 122040 

ttcagagctg cctcaaacac atgccatgtc agcattaact ataacttgaa atgatgacat 122100 

tagcagcagt gaaacacgtt tcatacccac taaaaatggg agaaatgcga ttactggtct 122160 

tcccaagagg gtcagaagga ttaggtacag tttgcacaag gattcaagta aaaaaaagta 122220 

trtcaagagc tatgaaattt caagttttac tgtgtacatc atgttatttc cctcttacag 122280 

ctgaaatcag tcaatacagt tttgctacaa ctaaaaccaa ccaaccaaac aaacggcaaa 122340 

ggaaaaaagg caggactggt aaaatattgt cctgagcagc agagtgtgta aggttactaa 122400 

gcagttataa tactgtgttg gaactgaaat atgcagctat gtcactctgc actttctagg 122460 

tacaataaca gtaaaagtac ccacatttac tatggaggta tttaatatat agcgtaaact 122520 

aaaaaaacag ctattcaatt gtttgatctt tttaaagcaa aagatgaaga aaatcaaggc 122580 

agaaatgata aatgaatttc aaaaactata caaaagaggg tattcaaggc acagagtcca 122640 

cctaatgtct taactgtgaa atggaagcat ttgacttgtt ttaaaaaggc cattaggtgt 122700 

cactaagggt aaaaagttac gtttatcagc tttcagaaag aggatggcat tccaaaggtg 122760 

cctctgagct ctgaagccca gcaagggata agagaaatga atcccaagtc cagctattgt 122820 

ccaaaagtcc tttttgttcc ctgataccat gactgaagtt gtcatgtcaa gacattgcct 122880 

tctgtctgtt cactacacct catgttctct cagtgctgtg ttcttagaga ggcagtactg 122940 

ctagtggtcc gcggaatgaa aacagccagg tgtaatcaca ctcttttgaa tgcctcatga 123000 

gaagtgctct ctggactaag tgtaactttc cttctcacat catttggaga agggaccatc 123060 

acagaatcat ttggagacca cctgcatcta ttctgcactt ctctccacat ttgcctatcg 123120 

tcttctaagc aaaccgaatc tatggctgaa gttacaagac tctgcatgtg gtgtcacagt 123180 

caccagaggc agaggactca agagaatatc ctggtccaga gctactctgg gatctcaaaa 123240 

gtcacttggg aaagcacagc atgttgaact agactttggt gtatttttcc aatctttatc 123300 

agtctacaaa atatcaactg gacatggagc agtggttcac tttggtgttt cctgtggctc 123360 

atgatctaag attcttcagt ctggaaaata aactgcagag attactgttg aggagcagca 123420 

agtgtcagtc tgctcagcaa gggaaggaga ctgagggcaa gggaaggaga ccgtgccagt 123480 

aaggctagaa gggcccgata agggatcaag tgctaaattt tccaatatta tagccatagt 123540 

ttagtgattt ccagagaaat atgtagtgat gtggtaggcc aaagatcaaa ccagaaggcc 123600 

ccagaatggc aacaggtgat gatccctgtg gcacattctc tatcttgaag taaaacagca 123660 

tggatccata taaatacatt cttgctcaac agcagaaata acaaacagta ttgcttactt 123720 

ctacgaatat cctaacaaaa catgtagatc acaatgccac tgaacctttg tatggatgga 123780 

atctgtgcaa tctgccatga ctaaagctct gtccaaaact gcacaactta gggtgcccag 123840 

cttctgaagg gatgtgaaat tatctgtgct atctcctttt cccttcttgt gttagctcca 123900 
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gtaaactcta ttttaagaaa taccttacag tttctgattg tcttctttac tggtatccaa 123960 

agggactcct atgcattaca gggtcctcca gcacagtgag gttcttggcc tggtgcaggc 124020 

atgcaaagta gcttaggcac gggtcacaat caagatactc agtttaatgc ttctcccaag 124080 

tgatgggatg ctaaaatctt acatgatttt aaaaggaaag tgttcaaact gtggaagaga 124140 

aatccactga caaataagaa agatacagaa aataaagtta gctatagaag acatggaaca 124200 

ggaaataatg ttagaactct gagggcaaga gtaagcctta acagtaatga caagcctcac 124260 

tggaggagct cttccacata cgttgttctc atgggcccag gagtctgtac tggaaattgg 124320 

cacacagttt gggtaccggg ggcgatcttt gtgagatgaa gccctgaact gccctgggtc 124380 

agctgcaggt gtctctgtaa tggatgaaaa caactcactg tgcaccagat tttcagctaa 124440 

taagaaaagc acatggcatc tctgctcaaa cagaatcata gaattgctca ggttggaaaa 124500 

gaccrtaaag gtcatcgagt ccaactgcaa cctaaccaac taccctaact ctaacaaccc 124560 

tcctctaaat catgtccctg agcaccacat ccaaacagtt tttaaacaca tccagggatg 124620 

gtgaatcaat cacatccctg gggagcctat tccagtgctt aacaacactt tctgtaaaga 124680 

agtttttcct gatatccaac ataaacttac cctggcacaa cttaaggcca aatttaatta 124740 

gaaaatgtag cagcactgca atgtagcaaa tgtaattacg aaaaggtggt agctgctagg 124800 

gacagaggac atgcaaatag acccaaaaga taaagactag aaacagaaaa aggggacatg 124860 

tgagaggtat gtttggagaa acataacaga ggagatattt gaaaggagat cttgggagca 124920 

caggcaaaga cacaatcctg ggaggaggtg ctccatgcta gaggatgtac ctctaaggca 124980 

ccgcagccat gggcaaccaa cacaggtcag cgtcatcctg gtgagactgt atcccacaag 125040 

cagctaacac tggagtaggg acagccccga agaactgcag cccaggcagc acactagagc 125100 

agagaaatct agttagcagc aaccactggc agacagaaat gattatatag attacatact 125160 

gaccctagcc tcttacactg cctactgcat cactgaaagg actgggaaga agagagtgca 125220 

ataacgtagc tgaaactagg aggaaggcaa ggagaactga agctgactag ggaaaagggg 125280 

gattaaaggt ttaagtgtct attccatagt ttgctggttt gttttttgtc aattcctgaa 125340 

tcagtaattt ttatgttaat tagcaaaaaa ttacaaacac tccccaagtc aggactgtta 125400 

cctacaacag aagctcagat cagctgagcc ttagtctttt ggtccctccc tagggaatgc 12 5460 

tgtatgtgtc tctctctcca ggcctgctca aaattgacct cagacccaaa cttttgctga 125520 

atctccagta ccacctcttt tgctcctaac tagataacaa agccctgagc gctttgcttt 125580 

tagcaaagcc tttaagtgcc attaccaact gcacctggag cctttaccta cccctatgga 125640 

cccaggctct atatttaagc tctgccctga accttcactt ctttcctgtc ctaagttaga 125700 

tgtactagta tggtgtgtac tatgtctcca gttcaaacac agctgtgccc atacctggcc 125760 

aaggactcct agtatgacct gggctgtgcc ttgctgctaa ggacctgctg ggtgattgct 125820 

ggacctgatc ctaatcctga attaagaaat gatttcttgg cttgactgga tgtgccctgt 125880 

ggtatgatac tgccttatga tttggactct tgtttgcagc tgtgcaaatc cctaaggagc 125940 
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ccagtctctg gccacctgga atcttgtcac taccaaactt cctgagggac tggtcttgct 126000 

ctgggttctg atctctggac agtactcacc ctttactcag cccaggctcc cagttaagcc 126060 

cctttccacc ctgccaggct ctccgctcca tccctagcag gggctctcat gacagtgtga 126120 

ccccccctta ctcaggtcag ggccacttgt gccacgttcc tttcctgtct tctgtccctg 126180 

ccttggctct aaagcagtgt gctaccatcc acaaccactg catctctcta aagtaagcct 126240 

ctcctgagcc caagtctctg taacgaggaa ggatgcactt tgctcagaag gatgcgaggc 126300 

tgcttctgag ctctgagggc actgacctcc catgaggtac accccatacc caggaccaca 126360 

attcagcctg ctggaaccat caactcctgc tggagtaagg ccatagcaag accagcatcc 126420 

acctccctgc agccctgccc tgcccagata ttgggcctgc tgatctcagg atgcagactt 126480 

gcttctcagc ttgacctaag cattgccctg tctttatgga cccacctggt tagcaagttc 126540 

agtgcagaag gaggctgttg gcatctagct aattttccac ccacattact gtctgctgac 126600 

tcattctacg tctctcccat cttgttacaa taataatttg ggagatcata ttgaaggtct 126660 

taataaagtc aaggcatgtg atattctctg ctttgccttt gtttctagaa taagccactt 126720 

catcatagaa gatgaaaatg ctgatcagca gagatctgtg cttgataaat ccatgctggc 126780 

ttttcctatc accttatatt ccttcatatg ccttgagaca cccaaggagg ccttggatca 126840 

gagctgtctg tagcagtcct aactggtata caattagttg tacaacaggt agtgatccgc 126900 

ataatagttg gcgtgagaaa gtgggcctgt gctgtgtcaa gcatagagtt tgggttccag 126960 

tcctgttctg catggcacat atgcctgagc agctgggtaa tctctgcatt ccaattggaa 127020 

ggcaggggcc tgtaggcagt tcccacttgg catgggtgat tgtaccacct gtgtcctcat 127080 

ctgtgaagca tcatgttttc attcaaatat ccttttgttt gacagtagaa atgaacagaa 127140 

ttgttttttt ttcctaagca aattctgcaa gagctctgaa gaacaaggtg tcagtgaact 127200 

tctagctcca tagataggac ttgcatcaca tgtcatgcct tgattggagg tctatccgat 127260 

actgaacaac ttgtggttcc ctgagggaat gtaagattac tgatactact ctctctttat 127320 

gttagctaca ataaatggta ggttaagcaa tagatacaga gtttgagtgc ctttcttaca 127380 

agcatcatag tgaacaaatc cactggtgat ctaccttttc aataactaca gagaattgta 127440 

atctcttgga ttctcctcct tccccgttct gaaaatgtgt tctttttttc caaatcagaa 127500 

accttcctca accaccctga ctattctttg gacattgttt tgttcttgct cctaaatagg 127560 

ctttataatt tttgtaagtg aaaggctttg catgcaggtg aggctacaac tcattcagta 127620 

acaatgagga agactgtcag attttgggga aaattctccc acccaacctt ttgctagcca 127680 

gtaagatgta atcactgaat gtcatgccac aaagaccata ccaacatcag accacatatc 127740 

tacaggaagc tttaaggaat cattgactgt acagtgaagg gtaaatcaaa ttaaaatgaa 127800 

tgtgaggtct gatacgagat atcctcatgg gaatcaagag caaagacaaa tagtttttca 127860 

cagtcttgtc atgatctgtc acagaccaag gcagcacagc aggcaacaat gttggtctct 127920 

tcagaatggc acagcaccgc tgcagaaaaa tgccaggtgg actatgaact cacatccaaa 127980 
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ggagcttgac ctgatacctg attttcttca aacaggggaa acaacacaat cccacaaaac 128040 

agctcagaga gaaaccatca ctgatggcta cagcaccaag gtatgcaatg gcaatccatt 128100 

cgacattcat ctgtgacctg agcaaaatga tttatctctc catgaatggt tgcttctttc 128160 

cctcatgaaa aggcaatttc cacactcaca atatgcaaca aagacaaaca gagaacaatt 128220 

aatgtgctcc ttcctaatgt taaaattgta gtggcaaaga ggagaacaaa atctcaagtt 128280 

ctgagtaggt tttagtgatt ggataagagg ctttgacctg tgagctcacc tggacttcat 128340 

atccttttgg ataaaaagtg cttttataac tttcaggtct ccgagtcttt attcatgaga 128400 

ctgttggttt agggacagac ccacaatgaa atgcctggca taggaaaggg cagcagagcc 128460 

ttagctgacc ttttcttggg acaagcattg tcaaacaatg tgtgacaaaa ctatttgtac 128520 

tgctttgcac agctgtgctg ggcagggcaa tccattgcca cctatcccag gtaaccttcc 128580 

aactgcaaga agattgttgc ttactctctc tagaccccca agtcaaacca actatgcagg 128640 

tatgctgaca acactatgat gacagcctgt tctgatcaag atctcatttg ttcatggaca 128700 

atttttgttg cttgcagctg gtcttccatt gggaaagagt gtagtatatc cttctcatct 128760 

gacagaaaag cagaaattct catgctccac acttaatcta cattgtttta aaccaccggc 128820 

tacttcttgg agaggaaaaa tggcttttat aagactcaca aaacaaagct ctgcaagtca 128880 

aatgcataca aaactgttct gtaggtctgg aatcaggaca ctatgtggaa gtcaaataga 128940 

gcagctttaa aaagcctttg ggatcattct catcttatat ttgcagcacg atactatgac 129000 

agtgataact gacataactg catcaatttc cttgatattt tatttgtctt aaagtacaag 129060 

acatagagat ggacgtaaag atggacatat gactcaggtc tggacaggtc cgtggtccat 129120 

gtatgataaa agagatgaag ggaaggagaa ttgagactgt ctaagaaggg cttcagggac 129180 

gttctgaagg cagatttgac tgaatcagat gtactgtcca agtctcatat gtagcaatgg 129240 

aaggctgata ttggagaaat ataaagaaat ggctgtgaac tcaaagtgac cctgaacaga 129300 

aaagggatat ggagttaaaa taatgtcaca gaactgaggt ttatatgata taccatgggc 129360 

tgcagagggt cagagtgctc caccatgggc ctctcttggg ctgcagggaa cttctgttct 129420 

acacctggaa cacctcctgc cctcctccgc actgacctca gtgtcatcag ggctgtttct 129480 

ctcacatttt ctcactcacc tctcccaact accattgtac agcagttgtt cttacatatt 129540 

gctcctcctg aggtacatct agcatcgatc actggctcag ctctggccag tggcagctcc 129600 

cttttgagga cacgggacag ctgctgggct ctgttcacag aggccactcc ggcagacctc 129660 

cactaccaca acttgtagtg taaatccact acaactttct gagctacaga aatgaaatgg 129720 

agaccctctc tgctatggga tacaaaagag gaaacgtggc gtttagctct ggctcactgg 129780 

tacacccaac cacagggtga gaagcagcct gttgttattc actactctta ggacagatta 129840 

tggtgaattg ttaataaaag catttcttca taacatccaa aggaggaaat acactaaatt 129900 

atatttttta ttaattaatt acacatgctt aattatatat ggcatggttg ctttggaaga 129960 

atcttgtcct tactgaccag atctgctgtt tgctgagaca aaatggctga caattttggc 130020 
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catggtggat accttccccc ttttctgtag cattaggaca gaagttattc tggagcctgt 130080 

ctgacaagtt agacttgata cctttaagta tttggaagtg tgcttttcat gctggatgtc 130140 

atctccagaa cctccctgtc tggtaagcag ttccctgcct tagtaagagc cgaaacggtc 130200 

tctcttttcc ttgttatctc accaggatat tacaatgtga caggactatc tgaactatgc 130260 

caacctgcaa attccaaata tatatatata tataagatat ctatacacaa attattagtg 130320 

tttgattgac accagatgac agagaagtgc atctgagaaa acctattccc aatctccttt 130380 

ctctttctgc agactgacat gcatttcata ggtagagata acatttactg ggaagcacat 130440 

ctatcatcac aaaaagcagg caagattttc agactttctt agtggctgaa atagaagcaa 130500 

aagacgtgat taaaaacaaa atgaaacaaa aaaaatcagt tgatacctgt ggtgtagaca 130560 

tccagcaaaa aaatattatt tgcactacca tcttgtctta agtcctcaga cttggcaagg 130620 

agaatgtaga tttccacagt atatatgttt tcacaaaagg aaggagagaa acaaaagaaa 130680 

atggcactga ctaaacttca gctagtggta taggaaagta attctgctta acagagattg 130740 

cagtgatctc tatgtatgtc ctgaagaatt atgttgtact tttttccccc atttttaaat 130800 

caaacagtgc tttacagagg tcagaatggt ttctttactg tttgtcaatt ctattatttc 130860 

aatacagaac aatagcttct ataactgaaa tatatttgct attgtatatt atgattgtcc 130920 

ctcgaaccat gaacactcct ccagctgaat ttcacaattc ctctgtcatc tgccaggcca 130980 

ttaagttatt catggaagat ctttgaggaa cactgcaagt tcatatcata aacacatttg 131040 

aaattgagta ttgttttgca ttgtatggag ctatgttttg ctgtatcctc agaataaaag 131100 

tttgttataa agcattcaca cccataaaaa gatagattta aatattccaa ctataggaaa 131160 

gaaagtgtgt ctgctcttca ctctagtctc agttggctcc ttcacatgca cgcttcttta 131220 

tttctcctat tttgtcaaga aaataatagg tcaagtcttg ttctcattta tgtcctgtct 131280 

agcgtggctc agatgcacat tgtacataca agaaggatca aatgaaacag acttctggtc 131340 

tgttactaca accatagtaa taagcacact aactaataat tgctaattat gttttccatc 131400 

tccaaggttc ccacattttt ctgttttctt aaagatccca ttatctggtt gtaactgaag 131460 

ctcaatggaa catgagcaat atttcccagt cttctctccc atccaacagt cctgatggat 131520 

tagcagaaca ggcagaaaac acattgttac ccagaattaa aaactaatat ttgctctcca 131580 

ttcaatccaa aatggaccta ttgaaactaa aatctaaccc aatcccatta aatgatttct 131640 

atggtgtcaa aggtcaaact tctgaaggga acctgtgggt gggtcacaat tcagactata 131700 

tattccccag ggctcagcca gtgtctgtac atacagctag aaagctgtat tgcctttagc 131760 

agtcaagctc gaaaggtaag caactctctg gaattacctt ctctctatat tagctcttac 131820 

ttgcacctaa actttaaaaa attaacaatt attgtgctat gtgttgtatc tttaagggtg 131880 

aagtacctgc gtgatacccc ctataaaaac ttctcacctg tgtatgcatt ctgcactatt 131940 

ttattatgtg taaaagcttt gtgtttgttt tcaggaggct tattctttgt gcttaaaata 132000 

tgtttttaat ttcagaacat cttatcctgt cgttcactat ctgatatgct ttgcagtttg 132060 
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cttgattaac ttctagccct acagagtgca cagagagcaa aatcatggtg ttcagtgaat 132120 

tctggggagt tattttaatg tgaaaattct ctagaagttt aattcctgca aagtgcagct 132180 

gctgatcact acacaagata aaaatgtggg gggtgcataa acgtatattc ttacaataat 132240 

agatacatgt gaacttatat acagaaaaga aaatgagaaa aatgtgtgtg tgtatactca 132300 

cacacgtggt cagtaaaaac ttttgagggg tttaatacag aaaatccaat cctgaggccc 132360 

cagcactcag tacgcatata aagggctggg ctctgaagga cttctgactt tcacagatta 132420 

tataaatctc aggaaagcaa ctagattcat gctggctcca aaagctgtgc tttatataag 132480 

cacactggct atacaatagt tgtacagttc agctctttat aatagaaaca gacagaacaa 132540 

gtataaatct tctattggtc tatgtcatga acaagaattc attcagtggc tctgttttat 132600 

agtaaacatt gctattttat catgtctgca tttctcttct gtctgaatgt caccactaaa 132660 

atttaactcc acagaaagtt tatactacag tacacatgca tatctttgag caaagcaaac 132720 

catacctgaa agtgcaatag agcagaatat gaattacatg cgtgtctttc tcctagacta 132780 

catgacccca tataaattac attccttatc tattctgcca tcaccaaaac aaaggtaaaa 132840 

atacttttga agatctactc atagcaagta gtgtgcaaca aacagatatt tctctacatt 132900 

tatttttagg gaataaaaat aagaaataaa atagtcagca agcctctgct ttctcatata 132960 

tctgtccaaa cctaaagttt actgaaattt gctctttgaa tttccagttt tgcaagccta 133020 

tcagattgtg ttttaatcag aggtactgaa aagtatcaat gaattctagc tttcactgaa 133080 

caaaaatatg tagaggcaac tggcttctgg gacagtttgc tacccaaaag acaactgaat 133140 

gcaaatacat aaatagattt atgaatatgg ttttgaacat gcacatgaga ggtggatata 133200 

gcaacagaca cattaccaca gaattacttt aaaactactt gttaacattt aattgcctaa 133260 

aaactgctcg taatttactg ttgtagccta ccatagagta ccctgcatgg tactatgtac 133320 

agcattccat ccttacattt tcactgttct gctgtttgct ctagacaact cagagttcac 133380 

catgggctcc atcggtgcag caagcatgga attttgtttt gatgtattca aggagctcaa 133440 

agtccaccat gccaatgaga acatcttcta ctgccccatt gccatcatgt cagctctagc 133500 

catggtatac ctgggtgcaa aagacagcac caggacacaa ataaataagg tgagcctaca 133560 

gttaaagatt aaaacctttg ccctgctcaa tggagccaca gcacttaatt gtatgataat 133620 

gtcccttgga aactgcatag ctcagaggct gaaaatctga aaccagagtt atctaaaagt 133680 

gtggccacct ccaactccca gagtgttacc caaatgcact agctagaaat cttgaaactg 133740 

gattgcataa cttctttttg tcataaccat tatttcagct actattattt tcaattacag 133800 

gttgttcact ttgataaact tccaggattc ggagacagta ttgaagctca ggtacagaaa 133860 

taatttcacc tccttctcta tgtccctttc ctctgagaag caaaatacag cagatgaagc 133920 

aatctcttaa ctgttccaag ccctctctga tgagcagcta gtgctctgca tccagcagtt 133980 

gggagaacac tgttcataag aacagagaaa aagaaggaag taacagggga ttcagaacaa 134040 

acagaagata aaactcagga caaaaatacc gtgtgaatga ggaaacttgt ggatatttgt 134100 
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acgcttaagc aagacagcta gatgattctg gataaatggg tctggttgga aaagaaggaa 134160 

agcctggctg atctgctgga gctagattat tgcagcaggt aggcaggagt tccctagaga 134220 

aaagtatgag ggaattacag aagaaaaaca gcacaaaatt gtaaatattg gaaaaggacc 134280 

acatcagtgt agttactagc agtaagacag acaggatgaa aaatagtttt gtaaacagaa 134340 

gtatctaact actttactct gttcatacac tacgtaaaac ctactaagta ataaaactag 134400 

aataacaaca tctttctttc tctttgtatt cagtgtggca catctgtaaa cgttcactct 134460 

tcacttagag acatcctcaa ccaaatcacc aaaccaaatg atgtttattc gttcagcctt 134520 

gccagtagac tttatgctga agagagatac ccaatcctgc cagtaagttg ctctaaaatc 134580 

tgatctgagt gtatttccat gccaaagctc taccattctg taatgcaaaa acagtcagag 134640 

ttccacatgt ttcactaaga aaatttcttt ttctcttgtt tttacaaatg aaagagagga 134700 

caaataacat ttctctatca ccgacctgaa actctacagt cttcagagaa tgaatggctt 134760 

gctaaaagaa tgtcaaatct tactatacag ctatttcata ttacactact aaatacacta 134820 

taaggcatag catgtagtaa tacagtgtaa aatagctttt tacactacta tattattaat 134880 

atctgttaat tccagtcttg catttcacat ttgcaaaacg ttttgaaatt cgtatctgaa 134940 

agctgaatac tcttgcttta caggaatact tgcagtgtgt gaaggaactg tatagaggag 135000 

gcttggaacc tatcaacttt caaacagctg cagatcaagc cagagagctc atcaattcct 135060 

gggtagaaag tcagacaaat ggtaaggtag aacatgcttt gtacatagtg agagttggtt 135120 

caccctaata ctgagaacct ggatatagct cagccagcgt gctttgcgtt caagcttacc 135180 

agagctgttg tatgcctgtt aagcagggca tacagtcatg aggctcttga aaaatcttaa 135240 

cagacaaagg gcaatggaaa atcggagtta agggatggta gggataaaat gcatagaaag 135300 

aggtaccaca attttgattt ttgccctaat gcctctctgc gtggttcctc aatttttcta 135360 

cttcattcct catctcctca gagcattcct ttccctcatg cttgaaacac agatgaaaga 135420 

ctgtgaattc taactgagat gaaaacatcc acaaccacac aacctctggt gtggagtcac 135480 

attctgtgaa ggcaaaaact aggccacgta atctatgtgt gcaagctacg tgtaagctat 135540 

gtgtgtgaca ggacaatgtg aggaacatac tatgtgcaca aggactgcag aataaacagg 135600 

agcaaagttt ttgaagaaaa cagagtaaaa tcctgttttc ctcttttgtt acattcttta 135660 

catatatctc aaatttcctc tttggttaga agcaagtaat atttatgttt cttggtactg 135720 

tttgggttga agaccattct gggataagag aaattccagt ggttcttccc ctaatcataa 13 5780 

aatgtacagg tttagttttt ttgtaacaca gaaatctctt catcttttat cttttgttgt 135840 

gattctttat agagagagaa acaagactta ctgacaatag cagcaagaaa atcaatcttg 135900 

gaagaacaag attgcagttg caaaaacaaa ccaatgtcct tgcccctaca tcctcttccc 13 5960 

cataaattct acattctcta tctaccttgt gcttgccaac atgatatacg taaactctct 136020 

tttcgtattc attcttaaag gaattatcag aaatgtcctt cagccaagct ccgtggattc 136080 

tcaaactgca atggttctgg ttaatgccat tgtcttcaaa ggactgtggg agaaagcatt 136140 
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taaggatgaa gacacacaag caatgccttt cagagtgact gaggtatatg ggcatacctt 136200 

agagatgtaa tctagaattt atgaagagag tagacatgtt gttatatgaa cactgcatta 136260 

gcgtatctgc tcatttgtct gcatctcttt cagacactgt gttaaaagca gggaattttc 136320 

cttatgtctc tctcatcaca atattcctga cattgcaaag ctcctgagaa ataacttcag 136380 

attcccactt ttcctaggaa ggtcttcctg gatgagaaca atcaatcatc ttaactgtaa 136440 

ctagatattt ctgcatctaa gaataatctt tgttaaaact atattctctc tctctttttt 136500 

tttttttttt ggttctccag caagaaagca aacctgtgca gatgatgtac cagattggtt 136560 

tatttagagt ggcatcaatg gcttctgaga aaatgaagat cctggagctt ccatttgcca 136620 

gtgggacaat gagcatgttg gtgctgttgc ctgatgaagt ctcaggcctt gagcaggtat 136680 

ggccctagaa gttggcttca gaatattaaa aacacatgga aatttagctg ttgtaaagct 136740 

cttttcaaca cagttatcct aaaacattta accagcacaa atttcatcat gattcaatat 136800 

gtgattgttg catagaagtg tagatttgtc ccactgggtc ctgcaatagc ccatgctgag 136860 

catggcttgc tgaaagaact gctttagagg gtgaaaagtt tgacacagca gacaagatga 136920 

ttctcaccta agcagctgtt actgtagtgg cttgaactct aaaggtcttg tatctccatt 136980 

cctgtgcact gaggagcttc ttggaaagtt catataaggt ttactagttc taactattat 137040 

ctcatttggt ggcactcaat gtgctttgtt cacgtcttca taaattaatc tatctaaaaa 137100 

ttggatgtgg ttaaagcaat ttcagaaata acatgtacat aatgtacaat tattgatatg 137160 

aacagaacac aggcatagca tattgtaatt aggaggactg tagttatttt gaataggaaa 137220 

cacaatgtaa taaatgagaa ttcattgaaa tgttagtatg ctaactcaat ctaaattata 137280 

aagataaaga ggcatttaat cacagctaga tttccatcac ttgtgacaga caggcatatg 137340 

aatgattatg tacagctcta ggaaaaaaag tatgtaggaa aactagtaca ttttgattag 137400 

aaagtctgaa aatgaggtgc cttgatcaaa gagaatacgt gtgtttgaga aaaaaaaagt 137460 

ttggatagag gtggtaagag agaatatatt gaaatggtgt ttctacaaac tgccatggcc 137520 

agatttgtgt aagagacatt cagtaagtag gcaaggaaag aaatattact aggtacaaag 137580 

caacattagt aataccaaaa gaaaccaatt attccagatg ccaatctcgt aatagggtta 137640 

agagatttcc acccctctag tagtcaccag tgcaaccagt aactttgcta atttacattt 137700 

tcttttttta aatggcagat atagctttga actgagtgat catgaactgg tactgtgtaa 137760 

ataagatgga agcatacttg ggagctaaac ttctagtttt taaaaactca aattctcttg 137820 

aaagatcagt tcccagtcta gtaacagctg atagtttaag tatcagtaat tggctaccat 137880 

taacaactgg ctcctgagag gtcttaaatg tagagacagc tttaaactca aaagcacaga 137940 

gtgattttta gaatagattt cccaagcaaa gaaaataaac agggaggagc tttaagggag 138000 

tagccatctc attattatta ttatttaaag aaatggcagc aaggctataa aagaaaaata 138060 

agacagagca gagaagaaag agtcatggta tgcttttcta tcttagcaaa attaatctct 138120 

acatgcctag gaaaaagcca tgacaagagc aatcagttca aaaggtgtat gcaaaaaaac 138180 
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acataatagt aactagtact gcattgccag gaaggaagtt atgtcgccat tccatggatc 138240 

tcattctcat ttccttgcag cttgagagta taatcaactt tgaaaaactg actgaatgga 138300 

ccagttctaa tgttatggaa gagaggaaga tcaaagtgta cttacctcgc atgaagatgg 138360 

aggaaaaata caacctcaca tctgtcttaa tggctatggg cattactgac gtgtttagct 138420 

cttcagccaa tctgtctggc atctcctcag cagagagcct gaagatatct caagctgtcc 138480 

atgcagcaca tgcagaaatc aatgaagcag gcagagaggt ggtagggtca gcagaggctg 138540 

gagtggatgc tgcaagcgtc tctgaagaat ttagggctga ccatccattc ctcttctgta 138600 

tcaagcacat cgcaaccaac gccgttctct tctttggcag atgtgtttcc ccttaaaaag 138660 

aagaaagctg aaaaactctg tcccttccaa caagacccag agcactgtag tatcaggggt 138720 

aaaatgaaaa gtatgttatc tgctgcatcc agacttcata aaagctggag cttaatctag 138780 

aaaaaaaatc agaaagaaat tacactgtga gaacaggtgc aattcacttt tcctttacac 138840 

agagtaatac tggtaactca tggatgaagg cttaagggaa tgaaattgga ctcacagtac 138900 

tgagtcatca cactgaaaaa tgcaacctga tacatcagca gaaggtttat gggggaaaaa 138960 

tgcagccttc caattaagcc agatatctgt atgaccaagc tgctccagaa ttagtcactc 139020 

aaaatctctc agattaaatt atcaactgtc accaaccatt cctatgctga caaggcaatt 139080 

gcttgttctc tgtgttcctg atactacaag gctcttcctg acttcctaaa gatgcattat 139140 

aaaaatctta taattcacat ttctccctaa actttgactc aatcatggta tgttggcaaa 139200 

tatggtatat tactattcaa attgttttcc ttgtacccat atgtaatggg tcttgtgaat 139260 

gtgctctttt gttcctttaa tcataataaa aacatgttta agcaaacact tttcacttgt 139320 

agtatttgaa gtacagcaag gttgtgtagc agggaaagaa tgacatgcag aggaataagt 139380 

atggacacac aggctagcag cgactgtaga acaagtacta atgggtgaga agttgaacaa 139440 

gagtccccta cagcaactta atctaataag ctagtggtct acatcagcta aaagagcata 139500 

gtgagggatg aaattggttc tcctttctaa gcatcacctg ggacaactca tctggagcag 139560 

tgtgtccaat ctgccgctgc cctgatcctg gctggggtga tgggacagac cttggctgcc 139620 

actgagacat ctgagacact gagatctgtc tcaactcaga tttacccaag aacagatcat 139680 

tgccaacaga acaaaatctc aaacttatgg ctagtgatga cagcagtcag ttgtcccatc 139740 

tgtgacccac caaggctggc atgctggaat gagcaggctt tggtggcttg tagttactgg 139800 

acagcaccac tgacatgggc aggggaaaaa ctgagcatgg tgtaaatcac tgcctcaaag 139860 

ccacttctct gtgcctgcac catgcttgaa agctcttcta caggagctgg gtttgttcaa 139920 

gaaagcttct gtttctccca tctgcttctt gtaccttcac agggacagag ttagaagggt 139980 

acagccatgg ctggaagggg ctgactttca aatgtgccta attttccttt ggttgctgct 140040 

gcagctgcag aagaaggggt tcagaagcca agagctttga gataaggatg cctaacctat 140100 

gttgaagaca tttgtgctga cacctcaggc cccaggatag gacaactgct ggattgtggc 140160 

taacccacta gctacagaac ctaatttata ttaccagatt aggaagagca aaagaacatg 140220 
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tatttataac aggaggtctt ctgtgcttct ctactaaaag gtgctgtgaa ggagcccaca 140280 

gtgcagcagt gtatgaggcc tgaaagaggc cgcagcacac gaagagccct ggtaggagca 140340 

gcacacagag gggcaggagg gctgggggaa ctgccaccca tggggacctg tgtgaagcag 140400 

tgcactcctg aggggtggac tgcgtgggaa aggaaaagaa agcaaacaga cctgtgatga 140460 

actgtcacac agactgcaga gtgacagagg agggcttgag gcagtgcgct tactgcaggg 140520 

agtggcgctc cttcctcaca gcagcgctaa cagcttggca ccaatattca gtagtctgtg 140580 

gtgatgcttt ttccagtttc accacacagc atttcgcttg ttctacttgt tttagctttc 140640 

cccctccaca agataacaca tactttgcca gtcagtccct aagaccttag cctaacagtt 140700 

agcaaacagg atcttgcaaa agaaggaaga taacatgaca ccaccttcac tggtgtataa 140760 

atagttcaaa tactttcctt cactttcccg taaattagtt gattgcaggt caggagataa 140820 

caggggaact tactgcaaga gagaaaatga tgtttaatat tgtcttggac tttctggtgg 140880 

tctgggcatg aaaatggagt actcaaaatc ctcaggacgt ttatttttca cctgatttat 140940 

tcccaaactg cactatttct aggccattgg agttcttatc aattaaatta tactttggct 141000 

ctctgctatc tcactccctt tcatcttcag catcactttc agcacaatta caggagaaga 141060 

cttagactca gagctttagg actcatcata agaggctttc attgctctgt caccacaccc 141120 

catatagatc tgtagtatac cacacatgtg aagaagcaca gtacattagt gcattacaga 141180 

gagacaaaac cacacctatt tgtgtgcctg cagtcttaca ccagcaggaa gataattaac 141240 

gtaatgaatt tctataaaaa tgagagaata tggcccctgg gtcctactgc ttgttctagt 141300 

cctgattctt caaacgtaag aatgcaagta aaattactca cttgaacaaa gtcagcaatt 141360 

tgcaagaact gatattctga agttcaagta attagagtga tttccagtac ttctggctgg 141420 

aacgggcagc tgaaaatcac ctggtccagc accttgctca aagcaggact atcttcaaag 141480 

ccatatcaga tagctccaga ccttccctag tcaagtgttg cctatctgca tggttggaga 141540 

acccacagcc ttctgattaa tttgatttta aacataaatt caaatgtcac tagcgtagca 141600 

gtagtgaaag ccattcaact ggctttactt tctcttacca aatgagagtt agctgcaggt 141660 

gaaaataagc cctgccagtt ctcatttttt ctcccacagc ccacaaagct ctcactgtct 141720 

gtcctcactt gtaatacttt tgaaccaaca tctacagatt atctctgtaa atcccaagca 141780 

gtacctagtc accacgtgaa caacaaattc ctacatttaa caatatttaa gagcaaaggc 141840 

cagaccatat gtagctgcac actacacatt tttagaccca atagtataat ttatactttg 141900 

actccatgtt gctgccatgt ggataacaat gcgcaatcat ttgtacctgg cttccttttc 141960 

taactagtat actcttaaac gtcacaagat aaagactcta gttctgtata gtctagctga 142020 

cttgtgacaa gagcaaacac tcacaatttc atggtactcc tgaggaaaaa aaggatccca 142080 

aactaatttt gagcttttac atattttttt ttaacctaca gagcaccttg ctacttctgc 142140 

tgaatgttag caatagcaac ccacagtctg aaatcaatgc aatgaacttc tactatgggt 142200 

accatactga tgacaggaat agtgcaagtc cttacactgg aaggctgact ccttagtcac 142260 
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ataggtaaaa tttagaaatt gcagctctga taagagatca gtatgggaaa gggaaaataa 142320 

tggggtgcca gatgagtgca ccttcctgaa aggaaggcag atatatggga attaaaggtg 142380 

gacaagggat gctgtggagg taccatcaac tttcacaggg ctgtatgtaa aagcagctct 142440 

ctttcctgtt gattctccgc tgcctcattt cttctgggca aagtttgtta ctctccagta 142500 

acgtcccttc ctcaaactgt tacctaatcc caccctcatt gccttctctg ttttgctctg 142560 

tccttcagca gtctctacct gcttcttaag gtagtgaagt aagagggcag ttctggagtc 142620 

aagctctgtt tctatgaggg taaaggccag ggagagaaag gtttgggagt gtgaggagag 142680 

cctttttcct gtgttgttca agtacttagt ccaagctgct ttcagctgca tctgcagaag 142740 

atggggaatg gagggtgatc aatgccattc ctccagccac agagcaaggg ctttgcctct 142800 

ccttgcatac agtatactag ctttccttag tcaaatgttt cctctgtgct gcagagtcca 142860 

aggtaaagag gctttgtcta cagctaggtc tatgttccta gagaaacaat tagcaactgc 142920 

aaaatcaaga ggtactaaga aagcctctga agctataccc aggggtctgg caaatgaagg 142980 

gggacagatc aagaagaaag aagagtctag agcagtttaa gggaataatg ccactagttt 143040 

taagccacac atctggtggt aagcttttaa ctttgaaaga gacagaaatc tcaagataca 143100 

ccagcccaaa atataatgga gccataaagg tctgcacgta gctgaatccc aactggaaag 143160 

aacagcttca aagagcttgg aagtgctgag gtgaagaaga gcatgtgatc attagatttc 143220 

aaaagaaggt cctcagcaca ataaccagaa agttcacctt tctgtgggac aaaagatgcg 143280 

tccctcacaa aggctggggg aacaaaatct ttgcatctca ttttgcctga gaggagaagg 143340 

aaatacaaga tcatcttgtt ttacttggtg tgtatcacat cattaatttc tatttggtca 143400 

ctactatgca gaacttgcta acttgaacca tgtaaaaagc acactaggtc tcaagagact 143460 

aaaatgcttc ttgcaacagg cagagtgtga gagatggaag gatggaaaaa tcttgcagtg 143 520 

atgaaggcac tgataagaga tgttgaaatg atactaacaa atggcactct atctttccca 143580 

agatctttgt cagcatgaag ggaaaattct attccaagct ctctttgagg ggttaccatg 143640 

ttccaggata aagacttgct gcatacacaa gcgcacttag tcaggtcact cagatcagtc 143700 

tcatgctaaa aagtgtgaaa atagaaatac aaataagggg ccaagcagat tactgaacag 143760 

caaagattgc cagtacgtgt ccacaatgag tatttggaca tttcactgcc gaaacttctg 143820 

aaaatatcaa ctgccttatg aaactctggt tattccaccg cacaggagta tttgtggttg 143880 

agctgcatga agaaatagca agtgtttaaa ctgatttctt aaaagagagc ctttcctcta 143940 

catgctgctc ttgcacatcc atgcgtggct cctcttcagg agcaggaatt ggttttctga 144000 

ttcagcagtt gtgtagctga cgtagttata ccctttgaga gatttcttca gaaaaatgac 144060 

atgtttaggc taaagtgcat gtaatccaca catacaccat tactcacaat gaagtactat 144120 

gcagcatgaa attcaggcta ttcttcttca tatttttggt tttaattgct accttggtta 144180 

cttaaaaaat gctcaccatc tgattcatgc aaaggaaaac tgcacactgg tagatgtgag 144240 

aacagcacgc atactcactt ccagataaac taatctctac tcagatatcg agatcattgc 144300 
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ttctccagaa gtgttgcact ggtcatcaga actgagtatc tcaggaaaag cactgtcttt 144360 

tctaattacg gcatctaagc taaagcacac agcggtaata gtgcagtatg acaaattatg 144420 

ccagtgttca attcatgtgc caaatctcac cacgcctttg cgttctgcag gtgtggagca 144480 

aaatgcctca gtgatattta gacaggaaca ccaccacacc tcttaacaac tcataaattc 144540 

taaatgctat tggagtatgt cagcaaagat tgcttggcaa aggttgcaaa tgtacatgta 144600 

atatgtacgc tttagatagc tatctacact gtttcaaaat aaagacgcgt gtgttctcac 144660 

tcaaagcttt aaagggaaat aagatactca aagaaataat ctcttttgaa ctttaaaagc 144720 

tatttgagac ttcacgatga tacaaactta tcccacataa aaatcttagg acataaaatc 144780 

cattacaacc attccagctg agacatatac accattgtta cgctttaatt tacaaggtca 144840 

ggacaagctc ttgctgcatt ctgtgacaaa agggctcctt tgcacaccaa aatccatgca 144900 

cccactccaa gcacctgatc actgatcacc attaccatca cttcagtctc cgtgctccca 144960 

ttccccatac tgttttggct cttgccaatt acaggattgt tatgaaacta aatgttaagc 145020 

tgccctccca caggattcca acattctcag gtttcaaaac cattgtcttc cccacccctc 145080 

ttatctcctg aagtccttat aatggtttgg acatttaaag tcctttcatg tttaaaactt 145140 

actggcctgc tctggctgag acaaaaacac gagcagaatg ctctgttggc tgaaccagaa 145200 

accattcccc cccagataaa taaacagcac ttttactggt aaaaaaagat attagaagat 145260 

gccaaagaaa tggagtagct tttcttcaag cataattttt ttcttttcaa ataccaaaca 145320 

ccttaggttt gaattacatt agattttcaa gaattacaaa gggttcgtag ttaaaacagc 145380 

atacgtacat gaaaaccagc catggcaagt ttcacacaaa tactgtgtga aagcagaagc 145440 

taccaaacct tcctctcaaa accctcaagt acatttagat cactttataa atgatctatg 145500 

tagacagcaa gtatttaacc tactcctgat cccaggtacc aatgaactga gcaacatact 145560 

gtgtaggaaa gttgcactga cttgtgctaa gttgcacgga aactgaagga aaacaaaatg 145620 

tgcttatata gctgagatct ggccagggtg cctggtgtgc tgccaatatt tgtcctgcca 145680 

aaatggaaac atgaatgacc acagtgaatg aactacaggc ttacttccca caggaaggat 145740 

actaccaata caaacataag actttgagca tgttggagtg ttgacttagt agagagtggg 145800 

agtgagggaa ccgctgctcc tgagtcagcc tcagcaccgc ccattgaact ctgtacctcc 145860 

tagccttggt aacttcacag gatgctggaa aatattatca agtcatcaca ttgatttatt 145920 

gaatatcttc cttttagatt ttagttgctt tgtatgtatt tttttttcca ctaataacca 145980 

gccatgctat tcaaaaggca tttttaaaag gcaacgttaa caccctgtac aaacaccatc 146040 

ctctcattca ttcaaatccc acatttctgc atatatggaa catgtcagtc acttcttgta 146100 

acagagcaag tactatgact agtcagcaaa ttaaattcat ccctgcttta aaaacagaaa 146160 

atccaagtaa ctgctccaag ggatgaagtt tatttagtgt atctatcatt tgttctacaa 146220 

cacagttaat tttgcaaaga tgactcaaat catttaaagc tttggaaatc atttaaggcc 146280 

aatgtaaaca gattacaact ttcccaggcg caatggaagc aattaattct gcagcacacc 146340 
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tctcctacac tactatactc tggaaaacgt aacagatgca tctaattata acccacactg 146400 

aaacatgctg tctttatgta gctatgaatt caaaacagct gaggggcagg aaagaaccat 146460 

cctcctaaag ctatgtggct gctcacctgt aggaaagcaa cttcagcaaa gctttgagtt 146520 

cccaggttac actgatgcag aagcttcacc actggcaagg tgctccttgt gtgagcaacc 146580 

tattctgctc tataaaacat ttagcagata atcccatatc tcagtcctca gtacaagaga 146640 

ctctgtgcca gccacttctg tacgaaataa gcccacacgt actttcatag acctcagggc 146700 

agaagaaaag tttcagaaag cagtttgtgc tgaggagata gaccttgggg gtgagtcttt 146760 

ctccatattg aggcggaatc cctcaaagac aagcagccct tatccaggtg ttcaaggtga 146820 

tattttcaac agagcaggga ggtaaagaat gaaataaagg gcagagttac ataggatttt 146880 

tcagtcagag gtgagagctg agatggacag gacaatgagg taaggacagt gtgactgtga 146940 

ggagaattag cgatggaaat gcttcactag ccaaggcaag aagaaaaaga gtattcaata 147000 

gaatatcaat ttctggggaa agaattcatc tctgaagggc tacatagggc aaatagctgc 147060 

tttcaactta gaacagggaa actgaggcag cagcaaaaaa aaaaaaaaaa gtctaatctg 147120 

aaacccacat caggttctac tgttgttgca gtgataagaa aagtgtctga tgagtgtttc 147180 

tcaaccttcg ttatctcaca gtgaaaacat tttcctggtt atacagtttt agaatcctcc 147240 

aatattacca aaaaatcatt ttactaaaaa tggaatccca caagaaatga ctaatttttt 147300 

atctgtagga aacggacaat agaaaaactc ataaatatga tgtcactgtc ctttcgctgt 147360 

ctcttccttg gaaattgttt ctattagagg aatcataagt aggtcagcta ctgcattttt 147420 

ttaccctcca aattgcaaaa gaaatgttgt ttccagcagt gatggttcaa gttgtaacta 147480 

gcctgttgcc acaaaaatgt ttatagaaat atttctgcag tcagttttgt aaggttcttg 147540 

tatggtatca ctctcaccgt cacttcacat cctactctga gatgattcag ttcttccata 147600 

gggatgtggc cttcagggca aaataaattg cagagtcatg agtcataaga acctttgaag 147660 

aacaggccag gcagactata aattcagcct acccatcatc tgaactgcat aaaccttgga 147720 

cagacccaga gcagcagtct tcctccctgc acaaaacaag gtactgtaat tatttctaga 147780 

gattatttat ttcacctact cttggatgat gtcgatctgt tgacaaatgc atagaaaaaa 147840 

aaggcagaag gaatctgaat agaaaacaat aaatacttga ggaagaaatt tactaagatg 147900 

gcacaggcaa ggtctagaag gggtaagtct cagaaatatg caggaagagt ggtttattct 147960 

atgtagttct cactggcaaa catgtatatc atagcagagt aagaaccatt gtgtttgctt 148020 

aagttagatc atttgttcat gtgctcttca attcttgtgt acgtcaagac acagtcagta 148080 

catctttatt ttatggctat tctgtatcaa ccagaatagc tcccactaca tacctagggc 148140 

tctcagcttc aactgcaatg caaataacaa agagcagcac ctgtgttcta ccaataagga 148200 

aatttgtctt gcagaactgg gaagctatga ttcggactag caccataaga cagagtttcc 148260 

agaaattttt gaaagttaaa aatggaattc aggatactta gcacacagta gttagagagc 148320 

tgcttgtcta gtgcttcaat attttcttgt atcctaaagg aatggaatat ttgttcacta 148380 
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cagttacgag ctccaaaagg ctcttgcacc agtagtcata agaaacaatc tctcagcatc 148440 

ttctagccct cgcaccagtg agtagcatta tcatatgtca ctccaagatg ctgtaaagga 148500 

cagtgcaata aacgtcctga tgaacaaata caaaatgaaa tatgaggctg ctttttctat 148560 

atcacttgag tatggttagt gttgttggtg aaacgggatg cactatactt aatataactt 148620 

tttagtagct aatcttcctc attttccata aagatatctc tcttctttcc atctgtaagt 148680 

tccttaccat atcattatat tgcttacata gaattcacca tatagttata aattgcatgc 148740 

tttttctttt tagtataaca ctgtaaccat cctgtttcga tgaattttct tttgtttctc 148800 

ctggctgtcc cttgcatgga tatctgtctt tctaagtgaa cttctgctga accaaggaga 148860 

gttcattcat gcactacaaa cagacgtagt tggcagaaat gaatattgta ccagtactga 148920 

caggtcagaa tgctttcatt cagtttcctg aggtcagagg agagctgaag aaaatacctg 148980 

gcacagtcta gattttgcct ttgaacatac actgccagtg agcctcagtc atacaaagga 149040 

tcactgtgct gcactggcat tcttccacag tcaataagtg ttaaaaatac ttttagaaag 149100 

cctacaaatc ataaaataaa cttcaaatac tgaaagggag cctgcagaca tatagcaaac 149160 

acaattaatt cttagctaat aacatctttt gtccctttct gcgcaggttc agaacaatgg 149220 

aagctttaaa taaagcaaac acaagctttg ctcttgactt tttcaaacat gagtgtcagg 149280 

aagatgacaa caagaacatt ttgttctccc ctctcagtat ttcatctgcc ctggctactg 149340 

tgtatctggg agccaaaggc aacactgcag atcagatggc aaaggtgagt ctgagaagag 149400 

ttgatctact ggagtaacat tctctatgat agaaatttag catgatgcat caaaggaaaa 149460 

ccttatgcag gtcaaaagat acagtctaca gtagcttctg taagcaggcc cacaccaaat 149520 

gggagcagtg gcattagtga cattgtctcc ttttaaatgt cattggaaga aagaagagct 149580 

cttaatccca aagctcaata acttgagcac tcaccagtga gagggagact cggattctcc 149640 

acctgccttg ctccagaaaa ttctcatttt ctgtctcatc tctctggaaa tggccctatg 149700 

cagaaagacc ctccactgta tctctagtaa gttttgtgct ttgctaacat aaatctacaa 149760 

acccacaagg tcagagaaga aacacagtca gggtgataat acaacccctt cttttgagta 149820 

cgtactttac aagaaaaatc acctactgaa gttcctaaac tctgtgcaaa gttcatagct 149880 

tcaaaaggca gcaatgagaa cagccccagt gcaaacatgg ttacctagca tgtactgcgg 149940 

gaggggctga tgaactggca tctgctaagg cagaagaagc tgctccacct gctaaggcag 150000 

caatccaact actgagctac aggacaaatg aggaccagca gggtcacaaa gaaaggagag 150060 

atcttctgtc aggaagaagg gaaaacaaaa caaaacagaa ggcttttgaa aaatgttcca 150120 

aggttagatg tacacctctg tagcctgggt aaggtgcaca tgcccagagg aaggcattta 150180 

gggtatcaat ttgctcccag tgtttacctg ctttctgaca tgtaccaggc tctccatttt 150240 

acacccatgc tttggcagtt tccacctgca gataactggc ccgtcccagg tattacccta 150300 

tgagtacaag agccgatttg aagcaggcaa gttcctctag aaataccaga tatatgagaa 150360 

ttctgcttgc agccctcatc ttagtgtgct caagacatcc tgtacacatg ggctcaaaag 150420 
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taaaatctgt ctttgtctct cttcatcacc agtcccatga cattactgaa agtttttact 150480 

gaaacagcaa attttctatc attgcattta ttactgcaat ttccactgca ggtactctac 150540 

ttcaacgaag ctgaaggagc cagaaacgtc accacaacca taagaatgca agtctattcc 150600 

agaacagatg agcgcctatc aaatcaccgt gcctgtttcc agaaggtata taaccaagtc 150660 

taatgatcat agaatcatag aatggcctgg gttgaaaagg accacaatga tcatctagtt 150720 

tcaatccccc tgctatgtgc agggtcacca accactagac caggctgtcc agagccacat 150780 

ccagcctggc cttgaatgcc tccagggatg gggcatccac aacctctttg ggcaacctgt 150840 

tccagtgcct catgatcaag ctacagtcat gctaacacct tcccttgctt ttattttctc 150900 

tctctgtttg ccttcctcaa atgcagggta cacaactgat tagtacagca tcctgtgata 150960 

ccttcacctt atgcaatact taagacatgc ttcccatttg taggtagaat tgcaaaattt 151020 

aacctcaaat ttgcaaaatt tgaaatttaa tgatgggact attctctatc agtggacctc 151080 

ttgatcttct cccttcagct ttgagatttc tctctttttt ttttctcttt ctttttttcc 151140 

cctttcctct tctttctttc ttctttttct ttttctttct ttctctcttt caatctttca 151200 

ttccttcatt cttttctttt taactactac tgttcaatta gttattgcta gttattcagg 151260 

aacatgttct ttgaacaggc acagtcccta tctcagaaca aatcagaaca acaaaactac 151320 

tctcccacac attggattgc agcatcaaca caacaacaac aacaacagaa aaaaacagcc 151380 

agagaagtaa cttttacaga ctatccatgc ttgatacatt tcagaaatta gtttctattt 151440 

catttgaaag tttagtggaa taaattggca tgttggaatt tcctaaggta gaccttgcaa 151500 

taaatcttaa agaaatggga attattgtat tcccgagata tttctttgac agactggcag 151560 

gcatcttttg ttaaaataga cataatttaa gagagcagaa aatttggaag tcaaactctg 151620 

agtgagtaag gtagtttttc ctcactgaca atgcagccac tgtggtaaaa agttcctctc 151680 

cctactcttt cccatcattc ttttttcttt ttgtgagtaa atcatttccc tgaagtctgt 151740 

ccacaaaacc cctgtggcag caagttttga taaatgggaa cttgggtcta cattccacag 151800 

ctacggtggg aagactaatt ttggggacta cgccaacaaa ccatttatgt tgcacgaaca 151860 

ggagatggat tgtttctcat gagtaatgct tgtctgaact gtaagaatta tggagcgctc 151920 

taggcaggga aaagaaactg ttctaatagc ttagaaattt agatagctgt tcatgcttct 151980 

gattttcttg cagtaacaag atgaatacaa cacaggtcca gtttcttagt ccactaattc 152040 

acagcttcat ttccttaagc tggtttgaca gtttgagtcc acattcatat aattctgtta 152100 

cataaatata aagaatttac tgcaattact acaacaaaaa gcatttgcaa aatattatta 152160 

tttagaggta ggttaaaaaa gttagaggca aacttaccat gtaattaact ttcataaatc 152220 

ttatcaggag tcacacagcc aggtcttcat gtatagttta gcaattacat tctgtctctc 152280 

tctctgtatg tacttcattt tgcaacctcc atttaaaagt ccttaaacat tctaaacagt 152340 

tcaagctttt actacttgca tcccagggct cttacagtgt ctatagcata tctgaaactt 152400 

ttagtaattt cacatcattc ttttaatatc tgtctgagtt agtacacatc ttgcattgca 152460 
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gtaaaggcaa caccacctga atagcagtag tttacataga gctgcatgag gaaagaattt 152520 

agaaattttg aactgtttta cagaaaaaaa aaaaatgtat aacccttatt tccttgtctc 152580 

caagacagaa ataggcaaat caggtaatat ccatgctggg tttaaagcac tcaacttgga 152640 

aatcaaccaa cccactaaaa gttacttgct tagaagcgtc aaccagttat atggagaaaa 152700 

gtcactgcct ttcagtaagg taggtaggcc atttattcat gttatcctgt gtgtgtcaga 152760 

ctttatgatc tatctatgac aacaaaccat aaattatatg ctttcaaata ttttcattac 152820 

atctgcaaat tgtgtaatta tctttaacat acttcctgtg aggttcttct tgagaattta 152880 

gatatcatga cttttatagg atgtatattt aatttgtgtg attcacagtt gtggctacgc 152940 

aaaaacattt aaattatgta tttccaaata aaatcaatac tatgttcttt tgacaatgct 153000 

gtgcttgtag cctacacaat ttttatgcat tctctccaat cggctatagt tatttattgg 153060 

cattcacact ggcaggcaac aaacataaga cagatgtcta tcttgcactg caggaatact 153120 

tacagttaac caagaaatac tacagtgcag aaccacaatc agttgacttt gtgggagcag 153180 

caaatgcaat cagaagagag atcaattcca cggttgaaca ccagactgaa ggtaagctct 153240 

agcatctcct ctcccagttc tgaaggaagc agttttagtc ttgaacaatt tctctgtgcc 153300 

caaaggcagg taaacaattt aactcagaaa ggaaaatcag aacagttttg ctgaagtaat 153360 

catctgctgg caagcccttt ctagaattat ctttcaccat ttgaaaggga gaggaatgtg 153420 

gtttcctcta taaatcaagg ttgtcatgta tttatgaata atctcaagct agaagtatgc 153480 

caaatcagca ctctaaattt ccttgtctta tgacttcaga aactacgcca gcatttactc 153 540 

tgaaacagta aagctgcaca aatatgtaaa cgttctttgt ttttctctag gtaaaataaa 153600 

aagtctgctg cctcctggat ccatagattc actcaccagg ctagtcctgg taaatgcgct 153660 

ctatttcaaa ggaaactggg caacaaagtt tgatgctgaa gataccaggc aaaggccttt 153720 

cagaataaat acggtatggt aacatactgc cttatatacc agactgcagg ttgaaaaagc 153780 

agtgaaaaag atggaggaga taaattcctg tcattcttta aagccacata gcactaaaat 153840 

tagtatattt aaaacatacg ttatatcctt cttagcacat cttcagtaca aagaccgcat 153900 

acatatgcta gcacccaagg cacaaataaa attatcagaa gccagcttga aacaaacttc 153960 

catacacctc ttaaagcagg aaaaacatag atgtgaatag aactgtatga actagttcta 154020 

tatattttca tttttaacca tacaatgaat tggagtggaa cagagcttcc agtaaatacg 154080 

tgtcatccta gctggctaag ataaccttcc cagcctccca gtgcattccc agaagagagg 154140 

ggccctctgt agatcctaca gcttctctta gagccacagg gatgtacctc catgctactt 154200 

caatgtagtc tttactgttc tgagtataaa tagcaagctt ttcatttgat ttgttgcagc 154260 

atacaactaa accagtgcca ataatgcacc tgagtgataa atttaattgg acctacatag 154320 

aatcagccca gattgatgtt cttgagcttc catatgtcaa taatgaactc agtatgttca 154380 

tcctgctacc acgggatatc actggcctac aaaaggtaaa gggtaacttt aaactcaaat 154440 

tgcgtgagaa acaacgtttt catgcatatc catggcaaag caatcctgtt tctaggaagg 154500 
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aaggtatcga taaggctaaa ggaaaaacaa accccaaact tgcccaaatg ttatgaagct 154560 

gaaccttttc aatgttttgt ttggttttct ttttaactcc tggcacgtgg cacctcgtgc 154620 

ttcctcatgt tgatcagtgc tggaaataag tagcccgaat ccaacaagat agatctaatt 154680 

ccagctgaag aacaacgagg acagaaagat agttctgctg actgtctgta ctgattcgga 154740 

cagatattat tacattaaaa agaaaagcac aaactggaca ccctccacta ctttctgtga 154800 

tgtttagagc taatatacat gtacactgcc accttctgta aacacactga acctgacttc 154860 

agatagtgaa ctactgtgaa attctcattt acattagtgg gtgttttgta gaaaaaaaaa 154920 

aaaaaagtta ttttcactaa attctaagac acacagaaaa cagaaatgtg agcagcaagt 154980 

caaatagact attgttactt gacagtgacg ttgttttaca aatatttaat ccctctatat 155040 

tccctgatga ttactaagaa cagttcaaat actgcactaa catgctgtag agcaaaacac 155100 

tccttcctag tagaaaatat ttcagagttg gcatttcact aatggtttct gtacttgaaa 155160 

agtacaattt tttttgctac aaaaaaaagc tacagaattt ttgtagtttg aaaagttctt 15 5220 

aaataagaat ataaaagaaa taacccctag ggaacagttt tttgaacact ctgtaatttt 15 5280 

ctggttctct tttcaattaa ctgcagctaa taaatgaatt gactttcgaa aaattgtctg 155340 

catggaccag tccagaatta atggagaaaa tgaaaatgga agtgtatctg cccaggttca 155400 

cagtagaaga gaaatacgac ctgaaatcta ctttgagcaa gatgggaata gaagacgctt 155460 

tcactgaagg tcaagctgat ttcaggggaa tgtcagagaa cgctgacctg tttttgtcac 155520 

aggtttttca caagtgttat gtggaagtca atgaagaagg cacagaggca gcggctgcca 155580 

gttcagcatc tctagcgtca cgaacccttg gtgctacagt tatttttgta gcagatcacc 155640 

cttttctctt cattatcaga cacaacaaga ccaagtgcat ccttttcttg ggaaggttct 155700 

gctcccccta gaaaatcagc tattaataaa caagccctta caacaacgat gaacacaatg 15 5760 

tatgccatga agaacacctt gacagacttt gcactttacc attttcctgt actattgaca 155820 

atctctttta gaagagagct caaattaaaa acatgaattc aaacctctga ttccttttcc 155880 

tctgcaaaga atcctagcat cgtatactgc actgtagaac actgaactgc acgctgaaca 155940 

acatggatgt gtcttttcag tgctgtccaa accagaactg ctacaatgca gaacagacta 156000 

ggctgatcta aacagtacct tctgacccag ttcctttcac acgtaagaag aaaagaaaca 156060 

ggagaaactc attcctgcat acagctgttt catctcttca aagccagctg tcccaggcca 156120 

gctcaatcac agccttgtca gttttaaatc agcttcacaa catagcatgg ctggtaatga 156180 

aacaaaagtg caaaatcctc tgtgttgctg atactggtgg tttgctcttg cacacaaagg 156240 

agctaacaca tgtactttct aatctctgtc cctcataaac tagcaaatac caaacaatac 156300 

agaaccagag taaagtaaaa tacatacctt gaaatgcttt cttttgtcat aacctttaat 156360 

tcattcaacg ctgttgcagc ccagcactgc actgctttac ttgcctttta ctttgccaca 156420 

tattttgctg cttggagcaa gtgggagaat aaagtctgtt atgttaactc cctaagtgct 156480 

gtctaaaaga ttacatgcaa attctcctct acatattcac tgctttcaca gcttttactc 156540 
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ctaaagggga ggaattccta atcagtcatg cacatctaag aacacaggtg atgctcctgt 156600 

ttctctgaat tcagaacagg gaggaaagga ctgggtctct taacagcact tgcacacaca 156660 

ctgacagcat ctcactagaa acatcccttc ccagaaaggt aggatacctt tttcctggca 156720 

gagggaagag cgctgactga tagtgagtcc tttctgtatt attccacgtg accaactgtg 156780 

gccaggctcc cttttggctc tgcttcccaa atgggaagga acgtagggaa gggccaatgg 156840 

caaccaaatt agagggtgag tcttgcattg aggaacacca ttttcccacc gtaagtagca 156900 

cagcactggg gcagactgcc cagaaaaaaa ttgaggattt cccattcttc aaagggctgt 156960 

agctgaccta atttaccagt gggtctgctc caggcatgag ctggactatg gaaataccca 157020 

taaaccagct tgtgtcttgt ttctatcaac atccattcta ccttaccacc tcaatttctc 157080 

atcctctctg gcacccttac agcttgacaa gcaggggcag ttagcttgct tgcttgctct 157140 

caagcatatt tctttgagac ttggaatctc cctagcttgt actttcccat caatcaatca 157200 

ctcagtggac ctttcttcta ccctgatttt ttgacaactc ctctcattgt actaaagtca 157260 

ccacttgttt agtttctggc atttcttcat tgccttgcat aatactgatt tctataactt 157320 

ttcagaagac ttgacccgct ctgttcttgt aatcaagctt acagcaaaat gcttggcaca 157380 

ccatacagtt tccttccctt cctcctcctt ccttccctgt cccctagact cttacagata 157440 

ttgccactac cttccccttc tcttaccacc aaagagctgc tctgctgtct tctgggacag 157500 

cagagtgaca actattgtga catatttatc cctgttgtat tgttttcact ccctccttgc 157560 

tcaagctctg ccagtggggc cgtttcagtt cttgcccacc tccatggcat gcaagtactg 157620 

cacactaact cagtactctg cagtgcttct ctgaagcgtt tcccagccat ggatgcaatg 157680 

agattctttc accaacacaa gaagcaaggt attcaacctt acaccttcaa tggctttgcc 157740 

tctgcctatg ctcctaatga tcttcactct gaagcttgac tctcagagtc tcattcagtc 157800 

agtaagctcc tgacatttta tgctgatgca tcttactcat agaatcatag aatggcttgg 157860 

gttagaagat catctagtcc caagcctcct gccaaccact aaatcaggca ctacatcagg 157920 

ctgcccaggg ccccatccag cctggcctcg aacatctcca gatctccagg gatggggcat 157980 

ccacagcttc tctggggaga ggttccagca cttcaccacc ttctcagtga aaactttcct 158040 

caacatctaa tctaactcta ccttatttta gttttaaaac attccccctt aacctatcac 158100 

taccttccca tgtgaaaagt tgatttccct ccttacttac aagcttcttc taggtacttg 158160 

aaggctgcta tcaggtctcc cctgatcctt cacttcttca gactgagcaa gccaagcttc 158220 

ctcagccgat cttcagagaa gaggtcttcc agttctctaa ccatcttcat ggccctgctc 158280 

tggacttgtt ccaaaggtcc acatctttcc tgtgctgggt gcctcagacc tggacacagt 158340 

actccagatg gggcctcatt tgggcagaac agagggggga caatcacctc cctctccctg 158400 

ctggtcaccc ttcttttgat gcagcccggg atattgttgg cctttcaggc ttcaagagca 158460 

cgctgctggc tcatgttaag ttcttcatct gccaggatcc ctaagtcctt ctcagtaggg 158520 

ctactctcga tgaattctta tcctagtatg gatacatatc tgggattgtc tcaacacaag 158580 
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tgcaacacct agcacttggc cttgttgaac ctcattatgt tcacatggac ccaatcctxa 158640 

agcctgccca ggtcgctatt aaaaaaaaca agtgtcatgg cttcacaagg ctggaaagtt 158700 

ggatcaggcc tacaatacct gctaacatcc agaaaccaaa accatgcatt ctggctctgt 158760 

aatcatttta ctagatttat ttagtttaaa caacaggcca gttgtcttca caacaacaga 158820 

agatcactga aatgagtgag tgacttgttt tactgtcctc tcagtagaca attgtggtgt 158880 

aacagttaaa atgagattat gatccacatt gttcctttga aacgcctaaa actaaaacat 158940 

aacatgatga acaaggaaga caaaactgta tgagatcttt ttgtgattca ttagaagctt 159000 

tgagtaggcg ggaacagtgt tgacatagga gagaaggaaa aggaagtgca aactgtacta 159060 

tatttctaaa tttattcact gcataacaca caggcacaga acctgactga ggacaagact 159120 

caggtcttct ctctcacggg accacaggta aacatttaaa ccaatttaaa taactttttt 159180 

gatgttttta aatggttatc tatagcttgt atgacaatgt aagtatatta aaacacacca 159240 

gagttattct gtagtcggga gcataattga tcacaagaag gaaaatcttg tcaggacagt 159300 

agctgtctta ctaattaaaa tgttcagttt gataaaggag tctcatactt caggtaaagc 159360 

aaaggccatt ttcattctgc cttgtatgag gtcaggccag ggactcagag gagcagagta 159420 

aaacagacag atttctatcc tgatgctcat ttggtcaggt tctccaagga gaggaagtca 159480 

ctctgttggt acagatttgg tgtagactgg ataacgactg ccagaaaaac tgaagtggtt 159540 

tggtaaccaa aatcttgata aatatctgtg gacttcagag attgtcctgc aatttctgca 159600 

gtgccattca ataaatataa atctttcttt acataataat aactactaca acaacaacat 159660 

tttccagtcc ctctatcaga aaacaacatc aagaaggcac tactgaacag gtaagttaaa 159720 

gtttggaatg ctcatagctt atatgcatag gtatttgcca gtttctgggg aaaataaaat 159780 

tgcaagaata taaagaagag attgtagtta gacttcgtga ataaaatggt aacactctaa 159840 

aagcaaataa caactttgcc atacattata ttatctgaaa tgggtgacta gccagaaaaa 159900 

ttccataagc ctaagagtta cacctaaata cattctcagt atcagctcct aattctatct 159960 

agatccaaaa tgaggtagtg aaaagttcaa atgtcccatg tacaaaaaac tacttaaact 160020 

tccctaggaa cattactttg ataatgagtt aagaatgaaa atgaacaaaa tatgcagctt 160080 

acaaatccac acacttttga aaaccaaagg cagaaagaaa cacaaataaa agggcagatc 160140 

tataaaagag gacatatcta taatcataga gaaatatgag atggataaca aaaacctaaa 160200 

agaaactgct gctcccagca ggtggcacat ggtatgtgta gaacatataa cgtacaacta 160260 

ggctattagt ttcaaaaggt acctacgtgc tccgttgcaa atgtaacatg taaatgtaaa 160320 

atgtaaatgc aaatgtaact aatatgcact acatacatca ttttagacac tcaaatacta 160380 

caattctgtc tgttgcctct ttccaggctg tagcaatgga acaggtctcg gcatcaattg 160440 

gcaactttac agttgatctt ttcaacaagc tgaatgagac caacagggac aaaaacattt 160500 

tcttttcccc ttggagcata tcatctgctc tcgccctgac atatctggct gcaaaaggca 160560 

gtacagcaag agagatggca gaggtaagta gctctgtgaa gctatgatgc tcaacactgc 160620 
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ccagcactgc tgttgagatg ccctgctccg ttgtcatagg gaaaaactac atttgagttt 160680 

gcacaaatgc attgctattg ctgagtgcaa tggctgtgga agggatttca gccttgtagt 160740 

gcacagacag aagcactgtg atgatgctca caggcaggag caatactatt ccttgttact 160800 

gtaggggatt tacatatact agagctccag tgtccctctg attagatcag aagatagcac 160860 

agtgtgfrtat cataaggatc caaacaagac aaccatttta tctcttttag gttttaggtc 160920 

atgcaaactc tttcatgtca gtttcttacc tttggaaacc ctgtttgcag gttcttcatt 160980 

tcactgaagc tgtgcgagct gaaagctctt ctgtggccag accttctcgg gggagaccaa 161040 

aaagaagaag aatggtatct attaaacatg aaatcccaag ataagagttc aaatgtctgg 161100 

atatagtttt taagagtcca ccatttcttg tttgcagctc tctttatgtt taaagtataa 161160 

aacccaatat acttcgcatc acatccaatt tcagttccct tcactcattc agactcaaaa 161220 

gtatagaagc acaagtcact ggtataatct gaaaggattg caatatggta aatcagttaa 161280 

tcaaatcata aaaggagccc tgcaaactgc agtggtggtg aatttggaaa gataaaaagt 161340 

aagagagagg aacagaaatt cttcccccac atctacccct tagcgtttca aaaacttcat 161400 

gccaaaaatg caactggtaa atgtacagtt tctctttcca aggaccctga gcatgagcaa 161460 

gctgaaaaca tccactctgg attcaaagag ctcctgacag ccttcaacaa acccagaaac 161520 

aactactcgc tgagaagtgc caaccgtatc tatgtggaaa aaacctacgc attgctgcct 161580 

gtaagttgaa tggttttatg tcaaagaaga aaaagaaaaa aaaagaaaag aaaagaaaag 161640 

aaaaaaagaa aaaaaagttt tgcattgtat tcacttacca ttaatagaac agatctgaag 161700 

ctgtccataa atgctgcaaa tgatgagtct tggcttccag tgataaactt cattggaaaa 161760 

tacaatttgg tcttctccca gtatataaga atgcacttgg ctgtaatgca ggactccttt 161820 

tcatgtaata cagctttatc actaggaacc tcagtacata caattgaaaa tgagatatta 161880 

aaatacacat atccagggga tttgcacagt cttccttcct tctccaaata aaaatgggaa 161940 

cgagagaata agagtatttc ctttggttat ttcctaacca ttaactcact gctcaataga 162000 

gagcaaaatg ctagatcctg caattgcctg tgtgcaaaaa gttaacaaga agtccagtag 162060 

ctaacaaatt actttttgga ctataaaaat actgtacaat acagaatgtt tccttctttc 162120 

gttctttttg tatgccattt tccagacata tctacagctc agtaagaaat actataaggc 162180 

agagccacag aaggttaact ttaagacagc accggaacaa tccagaaagg aaatcaacac 162240 

ctgggtggaa aaacaaaccg agagtaagtt gagctcaact ccaacatcct tcctcttccc 162300 

actgttccct tcgggaccct gttcccactc ctgtgactgt ggcatccagg tcatgccctc 162360 

tggtgtgggc agtagatggc tgtctgcttc cagctgcttg ccttgagact gtggcgtttt 162420 

ttcaggcagg agccaattgc tgtcagctag ccaggagaac tgggcaacaa acagcaaaca 162480 

gactaactgg tttatgtcag ggaagtaatc cagggagtag ggcactgagg cttgcacttt 162540 

ttcactaagg agttgaactg agtggataaa gaatcaacac attccctcac tgtgttacac 162600 

tggagtaaag cctgactttt ctgatttcaa aaggtaaaat aaagaatttg ctgagttcgg 162660 
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atgatgtgaa agccaccact aggttgatct tggtcaatgc catttacttc aaggcagaat 162720 

gggaagtgaa atttcaggca gaaaaaacat ctatacaacc cttccgactg agcaaggtaa 162780 

gctcctctgg tgtcctcctt aaaacaagca gactggagac tgcacccact accatctttt 162840 

atttcatcca tcctttaggc attccttggt aaacagactc tctgaaaagt tgtttacagc 162900 

aaaacatgtc agttgtcagc tcaccaacat ttatggaaca ttaagatgct gctcaggcaa 162960 

aggataacta gatccagatg gaacacagtt tccaaaaatg ctagggtcaa ttaaagcctt 163020 

tttgcaagac tgaggtataa gagctacatt gtaaaaatca gatattaaga gtccatcctt 163080 

cctgcacagg aactacatgc tatgctatgg acgagtgcag tacccgcgcc tctgtgctgc 163140 

acaatccggc tgtgaataca gctgctaaag tatggatgca gcagcacagc tccactggat 163200 

gggtgcatgg ccgagtgaga ctagaagtaa tgttgccaga gaggagatca caaaaaggct 163260 

gcacaacatt tatcctctca caccatagct gtttcattgc tgtaatgttg ggtgcctgta 163320 

tgccatgaat gctccatccc cctaattctt gaagatattt ctgactccct tcctctctcc 163380 

tctgtgggtt gatgtgcatg ttctggggaa aagagaacat cagttagctc agtccccagc 163440 

aaaatactct gggaaaagag ccaagatcag caatattgtc cagtcaagaa aagccttgga 163500 

aaaagaatgt caaatctctg ttacaaaagc tgcttatgaa agtttcctct ttacaaggaa 163 560 

ttcctttttt caaggaataa ttttaaccga taaataaata ccttacagaa caagtccaag 163620 

cccgtgaaga tgatgtatat gagagataca tttccagttc ttatcatgga aaaaatgaac 163680 

ttcaaaatga ttgagcttcc atacgtgaaa cgtgaactca gtatgttcat cctacttcct 163740 

gacgacatca aagatggtac tacgggtctt gagcaggtaa aaagttctgc tacatccatt 163800 

ctgtatcgcc actcagtcat cagaacaaaa aggacaggct gatgaccata cggccctttc 163860 

tttctttggc agttcattcg gcagaagtag cgcacaaaaa cttgcagcat tatgtctcac 163920 

atttgctttg cagcctgttc tctggtcatc agtaaaagca atttatattt catattttca 163980 

gctgaatgtt aaatacgcca tttaaaaatc tgtttaaatc attaaaaaaa aaaagacaat 164040 

cataattaat tggtttatcc ttgcaattat caaattcctc tcatttctta aacaacagct 164100 

ggaaagagaa ctcacctacg agaggctgtc agaatgggct gattcaaaga tgatgacaga 164160 

aactcttgtg gatctgcacc tgcctaagtt ctcactggag gacagaattg acctccgtga 164220 

tactctgaga aacatgggaa tgacaactgc cttcacaacc aatgctgatt tcagagggat 164280 

gactgataag aaggatttgg ctatttccaa agtcattcac cagtcttttg ttgcagttga 164340 

tgagaaaggc actgaggcag ctgctgctac agctgtaatt atatcattca caacttcagt 164400 

tatcaatcat gttctgaaat ttaaggttga tcaccctttc cacttcttca tcagacataa 164460 

caaatccaaa acaatcctgt tttttggcag attctgctgc ccagtagaat aaattattcc 164520 

tcactcctag agggatccaa agttcacttt tcaaaggaaa aaatgtgaac tgtagtatta 164580 

aaagctcagc cttcaatcat atagccataa gtactggaag tctatgtctt tttccttaag 164640 

taaggcagca cccagacacc accacgcgcc tcgaagactg tctctctact gctcctttcc 164700 
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attatgctca tgaaattgcc ttttatagaa agcaaatgct tgaggtacaa ttgctagcct 164760 

ctgttcacct tgcgttttgt ccttatttct ctaaactctc aagactgagg ttgataagta 164820 

tcccaaccag caaaaaagac caagaaaact acaacaatgt gccttattgc tacctcttac 164880 

tgaaatgtga cctaaacaat tcaaatctgc ttcccgtttt cattaacata attatatgtt 164940 

tcctggctaa catctgcacg gtctccttgc tacctggatc attgataagt gtatgatttg 165000 

taacttacga gtgcctttca gctaagatag tcccggtatt gacagaaaca ccagtaacat 165060 

ttttatggat gcttcacttc attattttgc catgatctac atttaaacaa taaatgaatt 165120 

tggaactgtg tttatgctat gcaagattct gactcacgta gctcttttac agcatcctgt 165180 

ataatgggtg gctgacacat atttccattc ttgttatttc aaaccaacca tcacatcacc 165240 

gctaacgaca aagtgctgag gcactctaat aaaccagggt cttactccca ctagatttca 165300 

tacaacactg aaaacactgt cgttcaacgt gttatcgtag acatatacta gacacaccaa 165360 

ttcaaatcaa agcctgtgat aacagagtta aggcatttgc ccagtcttgt tcaacagctt 165420 

caccaatagt ctagataaag ggatagagtg aatcctcaac aagtttgctg acaatatgaa 165480 

gctgggagga gtggttgata caccagaagg ctgtgctgcc actcactgag acccagacag 165540 

gttgtagagt tgggtggata gaaacccaat gagcttctac aagaacaagc atagggtgct 165600 

gcagctgggg aggaataact gcatgcatca ctacagatca gaggctagcc tgctggaaaa 165660 

aaacctctga agagaataag ctgggtgtct tggtagacaa cagattggcc atgagccaac 165720 

agtgtgccca tgtggcccag gtggccaatg gtatcctggg gtgaattaat aagagcgtgg 165780 

ccagcaagtt gagagaggtg attctcccca tctactctgc actggtgagg ccacatctgg 165840 

agtactatgt ccagttctgg gctcctcagt taaaggaaga cagggaactg ctgaggagag 165900 

tccagtggag gactgccaag ttgattaggg acctggagca tctcccatac aaagaaaggt 165960 

tgagagaccg cagactattc agctcagaga agaaaagact gaggggtgga tctcatcatt 166020 

gtttataaat atctaaagta caggagtcaa atgaatggac ccagactgtt ttcagtggtg 166080 

tgcagcacaa gggggcaatg gctaaaaact gcagtgcaga acgttctata caaacatgag 166140 

gaagtacttc tttattttga gagtgacaga gcactggaac aggcagccca gagaggttgt 166200 

ggagtctcct tctctagaga tatttaagat ctgcctggat gctttcctgt gcaacctgcc 166260 

gtagggaact gctttagcag gggttggact ggatgatctc caagaggtcc cttccaactt 166320 

ctatgattct gtgatctgaa cttgctttca ctgtaagcat tcagtttccc attgttgacc 166380 

actgcttatt gcactatcag cgccccatgg acctactgag aaagctcatt ttcttctgca 166440 

ttatgataca caagaaataa aagtctgacc agagcaaata gcccagaaga gtggcttcca 166500 

aacatccaat gctgtatcag caacatccat ttcttccatt agcttaggag ggcagtagct 166560 

gtgctatctt atgcctataa ggaattggat aaagttttgt agccattatg atgcccacct 166620 

accacacgtc acaggacatt atttcaagca acaagatctg aatgctggag ggaagtactt 166680 

tagaaatata tactcctagc aaaattccac ctgtaactac agctcactct atgcctcaac 166740 
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aaaaggtaga ctatttggaa atatatctcc acccaacaaa gtatatatac gccgccttct 166800 

ctgtacactg tgctgcttct aaactcagtg ccctccctca cctgtaaccc ccagctcagg 166860 

gttcagctgt ctgtcactga ggacccactt tggttcagcc cagctgcatg ctactgactc 166920 

agctttgcaa agaattgcac attgtaaatt gtataagtgt ggaaaagcac acaagtgaaa 166980 

actacacaat ataagtaaag taaaaacaca aagaagttac tgtcaggatg gaatagtcag 167040 

catagcaagg actcgagagg attgcacgta ggggtgtcaa acttcaatta aaaaaaagtt 167100 

aagaattgga attctccagc ttgttcaggt ccttctgaat ggcagcaaaa ccctctggtg 167160 

tatcatccac ccctccaaat tttgtattgt ctgcaaattt ggtgaaggtg aactctatct 167220 

cagcatcctg gtcattaatg aatgtgtaaa tagtactgac actggtgtta accgctggag 167280 

tacaccacta gtgacttcac cttcagctgg acttcatacc actgatcaca gctctctggg 167340 

cctggctggg ctcacttatc taacccatcg tttgtcagcc tggccataag gatatcacag 167400 

gaaaaagtgt taaatgctgt ttctaaagtt aaggcaaaca acatttagta ttctcccctc 167460 

atcaaagaga ctagctgcct catggtagaa ggcaatcagg ctggctaagc atgacttccc 167520 

cttcatatat caatcctgac tactctcaat ctccttctta ctcttcatga gtttcacagt 167580 

agtgtccaac tctgtggtag cctgacctcg cagcaagcag aaatggcaga gagagtacag 167640 

aataataaag acaaggaatg ttacagtgct ggtgtatttt tagtactcaa ggcttggatg 167700 

cagaaaaaaa ccagcaacaa aacaacctat tttagaacta aaatgcaaat gtaccaatat 167760 

tttttaaggt ttttatttca ccatcagtaa ctggaacaag aaaatccctt ttaaaatatt 167820 

atttatactc cagtcactta caggtctatt gcaaccactg tgatacttat tctcctttcc 167880 

tctcttgtag gacaaggctg aatgggactt cccacggagt aaaaaactgt aactggactt 167940 

aatctagcag tgattttaaa ttgtgctgct tgttaaccag gcataaatgc ctagatgtac 168000 

caaactatag agtaaactga tcacaggtat aggagtacaa cagcagaata gaaagataca 168060 

gcctcacatc cacactgaga aaggtgagga tgaaagatga attcatgaga aagcccgaga 168120 

aaagtgactt gctttatcag aaaaatttta tgaccatttg tcagagagca acctatttgt 168180 

ggccagcagg ggttttgatg ctgcaacaca ttgtttctac cattctgttt gcagtaagcc 168240 

actctttcct cttctccacg tggcacagca aaaccaagag caccttttct ttcccatgtt 168300 

ttctcccctt tgaaactcta ctccctctta ctaaacagga tcttgcagca gataccatag 168360 

agtgtccaca cagtttgggc acccttttgt aattttcctg tatttttgcc caatctttgt 168420 

agtttgtagt acagtaatta ttgctattaa ctgctttctt acctctccta aatattcagc 168480 

ccaacaggta acagcagttt cagatatgta gcaggaacat ttcttaacca cataactaaa 168540 

cataatagca ttcgatagat ttctgaagaa agaaaatgag attgatattc caggtaggaa 168600 

aaatgtgcac acttagagag agagcatgtg aaaaaagata ggtatacatg tcactcctga 168660 

aaccttgtcc tgtgtatgta tcagtcacat tcaaagtgaa tagcaccata aaaatttagt 168720 

tctagttcta gatatggttt tgtactcctc atacaaatgc ttttgcacat aggttgactg 168780 
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catgctgata actggtttac tttccagtgt cataactgtg agcagagggc actcctaagg 168840 

caattctgag taaccaaaca tgtactagaa aatatataca atgattgcaa aggaaactga 168900 

tcaatgcatt tcatccaagc cacgcagata aaaactggaa ttgctaatta cattggttaa 168960 

aataaaatgc ctgtgtacag ataactgtct tgcattatat ctttactgat ctgattccat 169020 

ataacatgct acttcgactc tctactacaa aacaacaagc agtttcagta aatcaaataa 169080 

ttacaagcag aacttccaag cagcttggca gcttttcgtg agtggcccaa actccctccc 169140 

ttcatagaag ataataagct tggcacagtg cagggcctgt aggttatcta aagcatcttc 169200 

acctcctctg ctaggaggaa atgcattctc ttaaagcacc acccaccagt tcgaataagt 169260 

accatgctca cgtagacttc atcacttgct ttaggtgact gtgggaatac agtctgagtc 169320 

tccttgagga tctgactgct gagacataaa aggattaagg gtaaaatatc acaacataag 169380 

ggctttgtgt gacctgacaa ccacaagtat tcaaatcttt ccactctgaa atttgacagc 169440 

aggatgaatg acactaacag cctacttctg tactcctttt cagctaaatc tttgagatta 169500 

gtttctctta attccctagt tcatagagta aaaagaaaag gaaactgtaa aattatttca 169560 

acgtgtccct tgtcaatgca ctgcaactct attcttttct gcaattccta catacttgca 169620 

gacaaagatt gaggagggca ccactgggaa aaaggcatac caggcttcat gtttcaggac 169680 

acacaccaac taacagtggt tttgaattgg cctctctacc ctcatctctc taattctgaa 169740 

agccagtgac ggagagaaga aatgggaata aatgagggaa caggcatatt aatacaagaa 169800 

gaggatgttg ggaagagaag ggaggtgcaa ccatttctac atttccaaaa gtattcgttg 169860 

cctcacagac cgccacagat cctgactgaa gaaacatctt aactcttccc tctctcacag 169920 

gatttcaggt agtgtaaata ttcttatcta aataaccagc tttattttga cagtcttaaa 169980 

aattcatata aattttccag aatggagagg gtaaatggta acttaaatgt catgcaggga 170040 

aatggagcac tcaagtcttc atacaaactg cagcctttag gaaaacactg cacaaagggc 170100 

acagtgttca ggatgcacat ccagactggg caaaacctgt gcagcacata gagagcacaa 170160 

gagtttcctc acttgcaggc agggtgccct aatcaccagg ctaccagctg atttaggatc 170220 

agaggactga acacagaacg cagatgataa tgcactccta ctgatcctgg agttgaacac 170280 

acgtgactgc atgactccat aacacagtgt tttagacaat cagcactgtc caagccactg 170340 

ctttattcgt gttaaataac tactggatca gcacccagcc ttttcctttc ctgaatgacc 170400 

aaagtggcta ttagacaggc agtgctgata caaacacaca tgctctctca agccctagaa 170460 

atgcttgatt gctgtgtgca aattcagttt tgacacgctc tggcaatagt ggagatacta 170520 

gtttgaaatt ctttaagcaa aaacattcag ctgagcccag ttctctagtt atgctctggg 170580 

agtatctggc tgatcagctc ctccaggcac agcagtaaga aataccaggc ttaaggtaaa 170640 

atgcttaagg tatgagctag gctgcagggc attcacctct gaagcacacc tgagaatgat 170700 

ccaaagctgt tttggagcac tgggaagctt tgatatccaa atttaagaat cccaaaagct 170760 

ccagctatct ctaagactga acagcagctg gttttgagga ctgctgcttt tgatttcagc 170820 
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atgcaaattg tgagtgaagc tggactgaaa attccactca ttctcattga cctaaggcaa 170880 

gtctttcact tgatgcaatc ctgcaccgac aatggaaatt tctgcattta tcctaatatt 170940 

tgtcaacagt actaaattca tctaaaatat gttagttaga accacgaacc tattctggat 171000 

ccagctatga aaaattactt taaccctggg gtttctatgt ttttgttatt atctttgatt 171060 

ttgtcccata aagagggaag cagaaagaga atgggaaatc tgcaggttct gataaaccag 171120 

cacgaaagta gaacagagct gttgcttaaa atgggaactg tttggagttg tatgtaagac 171180 

acaagagctt acttgaaaaa cagtcattaa caaatccagc tacagttgac aaatgcaggc 171240 

ctacgctcac aaagatcaaa taacaatcta cagatctcca tcagccttct aaagcatgtg 171300 

gcaaatggca tgaattcgca atagttcttt cagcagaaaa taatgcagtt taaggaagta 171360 

ggaataaaag ctcagcgcag cacagtggga aatatttcta tttcgctccc tcagaatgga 171420 

aggataattc taagaaactt aagtaaatgc tttttaagaa caagtttggt ttgggaaaac 171480 

gttcctaaaa tagtgtttgt ctgatgatga ataagcgaaa cggctgaaac aagaatataa 171540 

gcatttaaca gtgatgggaa agggagaaat agcgtaggtc ttaaaaaggg acttgctatc 171600 

tgactgtacg tggcgttttg gccattacag agccatccca tcagcgtgtg tacaagggtg 171660 

caggcaatct cctaccagca acttcccatt ttgttagatg ttgtgaagat gcagtcatca 171720 

cctccctctg ctaatcattc gtctcaggct gtattgatgg agtgtctttc agcatcaacc 171780 

aacagcttca ccctggacct ttacaaaaag ctggatgtaa cttccaaagg acaaaacatt 171840 

ttctttgctc cttggagtat tgcaactgct ctcgctatgg tctatctggg tgcaaaaggt 171900 

gacacagcaa cccagatggc taaggtaagt tctgaactta gcagtgtatc acttaaagct 171960 

ggctcccacg tgacttgcat tattcacctg cctacgtgga acaaaaagga gcaacatgag 172020 

agaacagtga tccgccatca tgttgggtgc aagaaaaaac actgcatgca gtcctgcagg 172080 

acactgttgt tccaacttat ttccaagagc ccccttctct gctaccacct ctacctttag 172140 

cttaaaattg ttctggcaga gtgaagctat ggacattaga gactgtcttc tcattctcca 172200 

cacccagtta gtaagtcctc tgaacccaag cttggctgca cagcagcagc agcaacatct 172260 

ttgcttgtga catggttagt ccataaataa ggtaatacac ccatgaagaa gggaaggttt 172320 

agattggatg tcaggcggaa gtttttcaca gagaatggtg aggtgctgga acaggctgcc 172380 

cacagaggct gtggatgccc cttccctgga ggtgttcaag gccaggttgg atgtggccct 172440 

gggcagcccg gtctagtatt aaatgtggag gttggtggcc ctgcctatgg aaagggattg 172500 

gagcttcatg atccttgggg tcccttccaa cccaagccac tctacgattc tatgaataat 172560 

ggggatatac tagaaatgaa aatatgatat catttataac cacttttgca aaactttcgg 172620 

tgatgtactc ctagattata tgcatttaca taaatgcatt tatctgtgta atgtactgta 172680 

gagttgtaca ttggtgcctc aatagtaaga ctacaaacca tcctatgttg tttgttctgc 172740 

ctgataacaa tctgaaaata aattccacat tgctaagcat gaattgacca tttctccaaa 172800 

tcaatccatg tctgagcaat cacattgatc tgttattaag tagtaaatga ctaaaattaa 172860 
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tataactatg atacggttat agaatctaaa tctagaccga ggtcttgttc tctataactt 172920 

taatagacta acatttgtac gatggctaaa ttatcctaag tagaaaacta acatcattac 172980 

gtaacactag agcacttcta tcttcacaaa acaaactgtc cttaagaaaa tttatcactg 173040 

ccacaggttc ttcattttaa ccagactgca agagaagaaa gtccttctga gatgacagca 173100 

ccttctctgc ggagcccaaa gagaagagac atggtattta ttttgacaag gctcacagaa 173160 

acaaatctcg gagaatggga cagcacaact gccttaaccc tctgaaattt gtctgtctat 173220 

acctactgtc ctattgcagt aagaactata cataaaaaat gtgaacaagc aggtaatatt 173280 

actgtaaaac tcacgaactc agaacttcaa agcagaacag agacaccaga agcattctga 173340 

ttgtctttta cactttgtta cttgctttta gctgctctat ggcaatagga atacctaata 173400 

ttttactaac ttcaggattt tttttttttt ctgggctgta aaacagccat tgccttcaca 173460 

tctgactcca gtcctgctct cttaaactct gttccacctt tgctatgcat gcacacatca 173520 

ggcaaacaaa attcccctgt aagctaacac acactgattc acccttgctc atcagctccc 173580 

cagagacttg acagcaggag gaactgcaca gaactcctta cgcctcaaag gtctgtcaac 173640 

tcaaacaagc tgcaattctt tagcacagaa actaagaaag ctcaaagaaa ccaagttaga 173700 

aaaactggaa ataaaaagga gcagttacat cattctccat ctagatggcc gtggattaat 173760 

atttaagata aatcattatg tccacattga ttttagctat tcactgcctt tctaagttgc 173820 

acagcaaggt ctgcctgatg tagccttcca gacatttctc tccattacct cttcactatg 173880 

tacccattgc cttgcaaaac ctatctacta tcctgttctc taccaagttc ttccccacat 173940 

gtcctttctg aaagcctgta cctctgcctg tgtgaaaaaa taccagagga aggaatgcct 174000 

cctcaccaat cttatgacaa gcccccatgc atcagcagca aagaatctcg gtgtctcaca 174060 

tgtagcgcat ggtacatgcc atggagcagg aaattatact gaagcagctt atccctacac 174120 

tacgaaagca acagctgaca agcaagctcc tgctccctaa aaccatcacc aaggacattc 174180 

tggacaggtt ccttccaaca tccaagcagt aacagcaaaa ttcacatcaa atagaaaatt 174240 

cgaatcaact caattacatg acatcagtat ctgtctgaac agagtagctc ctcaaaagct 174300 

gcaatgttgc cttaatgatt tttgtgataa tcaaaattag gcttgactgt gactggaatg 174360 

agatgaccca atatcctggg tgcaccatct gaggacagct gatgtcctca aaggtgtaag 174420 

cagctccgca gaatcaaatc ataaccccac agacataaaa caatgtatcc agttacacac 174480 

agtgctgcaa aagtaccaga tacccgaaga agagctccat ccctgcacac actttttaaa 174540 

ttaaaatgac ctggggattt taaataacca tagaaagtgt aatgcttcag ccaaaaatat 174600 

ttcaatacta attggtgcct ctttccaagg gtcttgagta tgaggaaact gaaaacatcc 174660 

actccggctt caaagaactc ctgtctgcca tcagcaaacc tagaaacact tacttgctga 174720 

aaagtgctaa ccaactgttt gaggacaaaa cctacccatt actgcctgtg agttgaacgt 174780 

ttctgcttaa gaatgttttg acaaagagta tttgacgtac atatttgaca atcataattt 174840 

tccaaatgtt ctgcctttta aatctactgc agcctttaaa actgtaagag ttctacagtt 174900 
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gaactacaga aagctcctgt ctcttttgag cctatttctt gttagcatca ctgttggttc 174960 

acttctcctc taaaggaagc ttctccatat cagtggtcat cttcctcccc cttctcacta 175020 

tctattacat tctagcattt ctgagattgg aggccagagc tatatagaaa agtcacacca 175080 

tacctttaca gagcggcata ttaacgttct cccttttatt acactgtcat ttccttcaac 175140 

ttttcacatc atatttgctt gtttgactat tactgggcag attatttttt tttttcatgc 175200 

aactgtccat tagaacttca atacctcttt tctaaagcaa taatcatcct tattagtatg 175260 

ctttgctgga tcattttcag atgtttcaat attgatactg taatctattt ccttcttctc 175320 

ttagtttcta gccttttaaa caagatcttt ctctgtagtc ccacggatac tcagcttctt 175380 

caaaagcctt tggtgagaag tcttccaaaa tgccttttgg caatcctaat tggctgtagt 175440 

ggtttgacct tctcatttgc aagcttcttt acaccttcaa aaacctccaa aagattcatg 175500 

atgtaggact tattgttaca aaatgctgtt ccaacacatc atgcttacct acgtgcccag 175560 

taggcctgct tatccatttg ccctgtacag actcatgagc ttgtaattct gtgtatcttt 175620 

gaaatctttt taagaaatgg cactgtatta atcatcctcc agtttcttgg aaggtgctgt 175680 

taaatgaggt tagcgttctt tcatcttgga gttaagtttc agtctcctga gtaatttctc 175740 

agtcattaaa ttcaatcagc cttactgaat gcttgttgct catgttctgc ataattttgt 175800 

gtaaccgtgg tttaagacta atcagggaat ttcctgccgg gaatctccat gaacagtttc 175860 

atagaagaca gtaatgtaaa agtcagttct agtttattct accataacat tcccttcata 175920 

tctgaccctc tactgacctc acaaactctt ggcagcttga ctctgaagca aatcttaaaa 175980 

cattgtttca actgtccttg aagctattct ttcttcttgt actgaactct gtcataaaac 176040 

ataaaaatgt acttcaagtc agtctgagaa atcaaaaata atttaaaaaa atgtgtagaa 176100 

tgtttatctc ataggatttc aaaattacaa atttgcatgt tggatttaaa acacaaaact 176160 

ttcaagcatc attttttgtg aaacacaaat actgaatttt tgatcagtcc ttgcttatta 176220 

tttaacacag aatcataatc acagaatcat agaatggcct gggttgaaaa ggaccacaat 176280 

gatcatcgag tttcaacccc tctgctatat gcagggtcaa caaccagcag accaggctgc 176340 

ccagagccac atccagcctg gcctggaatg cctccaggga tgggcatcca caacctcctt 176400 

gggcaacctg ttccagtgtg tcaccaccct ctgtgtgaaa aacttcctcc taatatctaa 176460 

cctaaacctc ccctgtctca gtttaaaaac cattccccct tgtcctatca ctgtccaccc 176520 

tcataaacag tcattccccc tccagtttat acactccttt caaatattgg aaggccacaa 176580 

tgaggtctcc ctggagcttt ctcttctcca agctaaacaa gtccagtccc ctcaaccctt 176640 

cttcatagga gaggtgctcc agccctctga tcatcttagc agccatcatc acagaatgta 176700 

tcagaatttg ttttacgggt tatctagttt caaatgaatt acattttctt ccagtatgta 176760 

ttagtatgta ttgcatgatc tgttgagatg atctttttct actatttttg tgcttaaatt 176820 

taactatata agcatacatt ttccaattct atccttcaga aatttttaca actgatcaca 176880 

aggtactacc aagcaaagcc acaagctgta aactttaaga cagatgcgga acaagccaga 176940 
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gcacagatca attcctgggt tgaaaacgaa actgagagta agtatcgctc tgatggcttt 177000 

ttcttttctc acttcaaaat catttgcatt tccacttgaa ttgctcttgc agtaagggat 177060 

ccataaagga tggaaactgt ggggaaatga tgaacaaatt gcagttaaat gtcttgaaga 177120 

aagccaacca ccaaaactaa ctgctgcccc ttgcaaagtt tttcccttga tttttcatgt 177180 

catagtctct tctgaagtat ttctgttcat aaggaagcag agtggatact acatggctcc 177240 

actctgatca gtgaaggttt tacttctgca agcttcaact ggttgcagcc aactccagag 177300 

aacttccacg ctttacacac ttcttaacat cttttactac taaaactgaa ataaatatgg 177360 

tttaaaaaac agtgatgctt caaaagccat ttatgtatgt acgctgtgaa aaatgcacag 177420 

ggaaaaaaaa tctctgagtg taaacacttt tgttagatag ctaggcatag agaaagcaca 177480 

tctgaaattg gtgagttgtg cattcgcagc gaattaacag tcctatctat ttgattttta 177540 

agggaagatc cagaatctgc tacctgcagg atctcttgat tctgacactg tattggtctt 177600 

agtaaatgct atttacttca aaggaaactg ggaaaagagg tttctggaaa aagacacatc 177660 

cgagatgccc ttcagattaa gcaaggtaaa ttccttcaaa atgtctatta tggcagagca 177720 

agaatcctct aaatatttca cctgcatttc acatcccagt acaacactac ttacagcagt 177780 

agcagatggt ataaactctg agaacagcaa cagtgaaaat aaatcagcag tctcatttat 177840 

acagatgcat gagattagga ttttcagtta agttagtagc ttcttggcac caaaacagtt 177900 

gaaaacacca tggttaagca gcttaaggac aagagaaagt ttctctaagt actgagatat 177960 

cattttcaga aggaattgag ctaattctga gagcagtact tcgacaccta ggtctctttt 178020 

catgcttttc agacagaggc tgtataatgt gagctcaagt agcctaagtg ttctttccta 178080 

atgccctggc cattgcgtaa aacctcacgt ggaattctca agagggtttg tcattttagc 178140 

cagatgcgta tggatgatgt gttcagcatg cattgtgggc acgactgagc ttacagtatc 178200 

tcagtgattg tgcatggaca atttacagta gctgacagca tgcatacttt cggcttgtgt 178260 

caaaggtgag caaaaagaat tttcattcag aacacgttgt ttgacatgag attacgagtg 178320 

caaacacctt ttgtatgtct ggtgatgtga agcaattgtg tcgatactgt ggctgtgtta 178380 

tctgaaacct actacattgc atgcgcagtt ttaggacctg taatagtaca cggtgcacag 178440 

aaagggtttc attcacagag tggctgatag caaagcctgc aaacagataa gcttttgcac 178500 

ttgtgtaaca atggaaaaga gagagtggat atatcagtga aggtctctga gcataataca 178560 

gcgtaagagt tcagatgatt actgtctaac gcgatttcag ttggtaatcc aaacctctac 178620 

agtttgggaa agagaaaaaa caagcagagg tcacagcaaa tatggtccat aggtaaattc 178680 

aatcaatcag tgctgtccgg aagcatacaa aagagttgat gacatccaga gaatgaaagt 178740 

cagcattttt ttcccctccc aatcaacaca ttcactcaag aaatgtaaag ttttggggaa 178800 

aacttgaaac atacaaagta gtttcttgtt taccaaagct aactctttca aaagagtgag 178860 

aaatacattg catgtaatta tgttatcagg tggtgtctgt gctttttttt tctttttttt 178920 

cttttcttct gaagatccct ttgactttga aacaggagaa atggcactgg gaaagaataa 178980 
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tgccaagtct tatacttgtt tacaattttt ttttgccttc agttcaacaa agcaagtaat 179040 

ctttaccatt caccttaagg aataagtaca actaatcttt ttcttttctg ttctttttta 179100 

acatctgaat catttcacag accaagacta aagcagtaca gatgatgttt ctcagagata 179160 

catttttgat gctccatgaa caaacaatga aattcaaaat tattgagctg ccgtacgcgg 179220 

aaaatgaact cagcatgttc gtactcctac cagatgacat cagtgataac actactggtc 179280 

tggagctggt aaaactgaca tactgcatca cacgcactac aaagcactaa cagaaataga 179340 

tgaaaacagt gaggaagaat gaacttcaaa tgacacaatg actgctcagc ctaggtttca 179400 

gggcatctat taatgatgca aaaatacaaa tctacctgag gatacaccta aaaaagtatg 179460 

cccactctac tctcttagcc tattcggtgc ctccttttct acctccagaa tagcagaata 179520 

acgaaagcaa gaatcaaatc taaaccactg tgccccagaa ttaatcttct gagggcaaca 179580 

ctaaccagtt ttatgtcatc cgcagtccag atttccacct gatactttgt aacgaggctt 179640 

ttcaaactcg gggctgactt accttgaccc atgaggtatc agcagccact catgaccgtg 179700 

ccaggattag ttcctgaatc taaatacatc agagcttcag aatctaaata catcagggca 179760 

aatatctttt tatttgctct tgaggtccca tgcattccac ttacttacca ctactaagag 179820 

aaatgcctta caaattcaca cataccaagc acttattaat gtggttaagt tggacactgc 179880 

ataaaagcaa cacttctcat atccacctcc aaaataatga attattctga aggttcactc 179940 

tacacctcac tgcatttaag gaaacagata gaagtacagg tcactcagca ctatgcagga 180000 

tcacatccta agaatatgca gcacatttca gctgtactca cagctggtag ttggaccttt 180060 

taaatctaga gcattagaca ccaatgtatg catgccttct tttttctgtt gcattatgac 180120 

tatattctta taaaattcat tgcaggtaga aagagagctg acccacgaaa aattagctga 180180 

atggtccaac tcagcccgta tgatgaaagt cgaagtggaa ctgtacctgc ccaagttgaa 180240 

gattgaagaa aattatgatc ttacatccac tttgagcaac atggggatac aaaatgcttt 180300 

tgaccctgtt caggctgatt tcacaaggat gtcagcaaag aaggacttct tcctatcaaa 180360 

agttattcac aaagcttttg tggaggtcaa tgaagaaggt accgaggcag cagctgccac 180420 

aggtgtcctg gtgttgaggt caagaacacc tagagtaact ttcaaagccg accacccttt 180480 

tctcttcttc atcagacaca acaaatccaa aaccatcctc ttctttggca gactatgctc 180540 

accttagtca gagtcactcc ctgctctaca gagcaggaga tgctggcttg ccagctcaag 180600 

ggcagagctt gatactcctg ctgcagctga gggactaaga cctgcactct ttcagactac 180660 

acattccaca gcccaaggca aagcttcaac tactccagat agccatagca gtgcctgtag 180720 

atgcatttga ttccttcctc ttgcagcagt agatacaaac acatggcact atcttcgttc 180780 

tcacaagtag agcacctgat tcaggtgtgc atcttcaccc ttccaccctg ccataattag 180840 

cccctgctcc tctgtagctc ttgactagtt ctttttgtta cagaggcaca cacagcccaa 180900 

gcttaagtct ttaccagttc acttccattc tactgattgc ctgaaagaca taacaagcac 180960 

acactcccac gtgggctatt tcctcgcacg gagttacagg tgtgacagaa gagcctgacc 181020 

Page 89 



WO 2004/094640 



PCT/US2003/039244 



Chicken ovalbumin Locus. ST25 
catgctgctg actttataca aagcagcacc tgcttcaaaa atagcagtac tgataataaa 181080 

caacccctcg tagcttgatg gtgctttctg tcagctctac caggagggga aggcagaagg 181140 

ggaaatcaag cagcgacaag aggctcgcgg aggtagcgac ctccgagcta aaatggccgc 181200 

ctcccactgc tgcagcgagt gctcagggcc gctttccgca gctgagctcc agccctctcc 181260 

cccacgatgg gcggcccgtg gctaggcaaa aacttccggg aggagggcgg ggcagaggcc 181320 

aggggaaagc tggtgctcgg ctgggtgagt gtggagggtc tgtgttgttg ttttctgcgg 181380 

gaaacacgca ttggtttttt gaggggagac ggtagcgttt ccctcgcggc ggcgctctga 181440 

gcggtttcgg cgggcgcggc cgccgggcgt tgaccgggtg ctggaggcgg gaggggcccc 181500 

gcagagttcc gcaccgctgg aatccatccc tgtcatccag ccctgcctct gtgggttttg 181560 

tggcaaacag gcggaaatcg atggagaggt gcgagcttca gcctgttctg agtcacaggg 181620 

agagagcttg gccaattgtc ctgcgcccag ccttattgga gctgtaaggt gcacgggatt 181680 

aaatcgctcc tgcttcaggc agaatggaag gactgtttca gtccaagttt tcttttcatc 181740 

agtgttttta tggctatggg cagaaggaaa catgagtaca gctgcagctg ttgaacgtag 181800 

ccaagctcct accaagaatt tgtcttagag gaaacatgcc tgaggaaact tgctgctacc 181860 

gcttgtttga gatgatgaat cattaataca aagtaggcgt tggctctgta ttttctagca 181920 

acgtaccaac accaggcact gccttagggg aaaaaaaaca aaccaccttt actactagtt 181980 

gatatcctgc gatgtctgct ggcacttatc tgtaacttac tccacgttct ggcactcgtt 182040 

gctccttcct gtaggtatgt agtataactt cggattagtt agctacctgc tcggctgacg 182100 

tatgtgaagt ctgacaagca ctgagctacg tatgtgccat gaagttccca ataaaccgtt 182160 

tactttattg cgtctgtttc catcgtgtag acaataaaag gcaaactgca gtggactttg 182220 

attttgtacc acagcaggaa accccagtaa tctgtaatgc tgaccagata aatttcgttt 182280 

gaatattgta gatcgagtca ttcagttgga ttctggcaga ctgactgcta ggtctagaac 182340 

acaagtgaag taatcttgaa gggaatactg aagacacaca gactttgaga aggtgagttt 182400 

ataattctgc cattctgata cctttctgct ttggttttcc tgtaaagcaa ataactgtct 182460 

ctgtggagcc aaaggagact tattctacca agtcctagta tgctcatctc aaaaaatata 182520 

gtattattta ctccatgaag aagaccaatg acttttcctc actacaagaa agacattgag 182580 

gtcttggagt gtttccagag aagggcaaga aagccgtgaa gggtctggag cacaagtctt 182640 

actattgagt ggctgagggg gctgggattg tacagcctga agaagaggag gctcagagga 182700 

gactttatca ctctgtacag tgacctgaaa ggaggttgta gttggcctct tttcctgggt 182760 

aacagcaata ggatgagagg ggatggtctc aagttgtgtc aaaggaggtt cagatgggat 182820 

attatgaaca atttattttc cgagtggtga ggcactggca cagactgccc aggaaggtgg 182880 

tggagtcacg tctctggagg catttaagaa acatgtaaat atggcactga gggatgtggg 182940 

ttagtggaca tggtgggaat gggttgacag ttgtactaga tgatcttata tttgctttat 183000 

ggtttatatt gagaaatgta aaagacagaa ataggttgtc agtttgtgat caaataaatt 183060 
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taagccaatc ttcatttttt tttttctcct aggctttgaa ccatggatag cctcagtgca 183120 

gcaaattcca cttttgctct tgacctttta aatgagctgc gtgagaaaag cagcacaaag 183180 

aatctattct tttctccttt tagtatttct tctgctttgt ctatgatttt actgggttca 183240 

aaaggggaca ctgaagccca gatagcaaag gtatgtatcc aaacgtaatg tattggattt 183300 

gatgcatata tcatctactt aatgatatat gaactacaga tctgagatct gtattacagt 183360 

ctgtgacctc taattgctga attgttacag tcattctggc ctcagaggtc agaagtcttc 183420 

cttaggtatg tacataagca gaacctattt ctattgagtt tatgtatagg acttactgca 183480 

gtgtgaaatt aagagattcc tgttttttgg ggtgtgtgtg ggtttttgtt tgtgatacgg 183540 

agatcttcct tttatatgtc attaacaggc acctggaatt tctttttttt tttacttaca 183600 

tatttgtata tttagagcta tagatgaatc tccagttaca taaaataatt tactctgtaa 183660 

tctttttggg cttaatatca gactttgcat acttcaaaaa tgtagccaga taatcaaggg 183720 

aaaaaaaatc caacatacaa gcatgtcatg ttaaacagtc ccagatttta ggaaacaaac 183780 

aaaaaaatga tcagttgctt gttcagtgta atagcttttg ttttcacaac ctgtaatctc 183840 

aatcctggaa catccagaag aaagaagtga tacagggcta agaacatagc tctgaagttc 183900 

cagagaatac cccagcaaag attcaatggg gcaaagctgc gtggccagtg aagagtaaaa 183960 

ttcataatgt aaacttgcaa ttaaattacc aggagagcag ttaaggagtg cagtggtggg 184020 

cctgttgtgt gacagtaggg tcaaatctat cattaactgc agtgcagttt attctacgtt 184080 

cactaaggtg cgtgcctgcc tctctctttc tggtattgta atttggagta gatcatcaat 184140 

actttttcat ttgtagctat ggtagtagtg atgaggctga atgaggatga agctgatgtg 184200 

ttgttttaat gggaatttaa atatttgctt gtgttgacat cggctccagc agcctatttc 184260 

ctgttatcgc ttgaaggatc gggtttgcat ctaaggtatt aaataagatg ctttggtgct 184320 

attataatca gtgtgaaaaa ttatggaaag ttgttttttt ttatttaatc ttcaggctcc 184380 

tttgtttctg gattttaaca gttttgctag gttttatagg gtggagatta taaatcctca 184440 

gttctctaag aagtactgtg tacagcatta agaaaagggc agaatgtgtc tgcactcaga 184500 

cttctttgga ggctggatgg gttccttaga aagcagggag ataaaccagg taacctccat 184560 

agcttccttc caacctcaac cattgtgtga tcctctaatg cttggacaaa atgaagataa 184620 

ataccactca cttttcagca acgtaatttc ttgcttatac aacatctgtg tggatacatt 184680 

gtacgtgact tgtgtaatga aaaatctgct ggcttcaagt ctcaaaactc atttaaaaac 184740 

agaacaattg tgctgatgca agtgtgtcag agattacgtg gactccacag aaggtatttg 184800 

tctctctgca ggtgctttct ttgaacaaag ctgaggatgc tcacaatggg tatcagtcgc 184860 

ttctctctga aattaacaac cctgacacca aatacatcct cagaactgct aaccgacttt 184920 

atggagaaaa gacatttgag tttctctcag taagtaaaca ttaaatttgg gtgttgtgaa 184980 

gtataatgta cttgctagct attccccttg aaggttagat aaaggctttg ggttttactc 185040 

tccaaatttt tctaggctga gacttacaac ctgagagtct atgcaaaaag caggatgtga 185100 
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acagaatgga gaagctactt ttagattata tgaatgcaca actggtgcaa gaccatgaaa 185160 

aaaaactaaa tcttctaggt ttcttggtcc acttttggtg ggttctagga tcaaatgaat 185220 

gacaaatctc cttgcctttg ataacctgta gctatgatga aaacaactgt tactgctgtc 185280 

cagcatgggc agaacttttc ttttttctta attaaacaat ccagagaaca tgctgagagg 185340 

agtatgtgac tcttaatatt ttccttataa gtatatatac acaagagggc acaggtacgt 185400 

tgcatataca ttacatatac attataacat tgtatgttct ctcactcaag caaaaagaac 185460 

aaacggaaga aacaaaaaga aacaacccag acaatcattt ctcagttgag tactgtagaa 185520 

tgttctggtg tattaaagaa gacatttgac ttcttaataa caaagaggaa gataattcct 185580 

agctcagatg gctaataaaa caactgataa gaacatgtca gacaaaacct gaatggcttt 185640 

atatcaagct gggggaagag aggatataga tttttctcag tgtacttaaa aacatctgtg 185700 

gctgaatgtc agtaaaatgc attgctaaaa agctgtttta aatgttcatg gcagtcattt 185760 

atagaatcga gtcagaaatt ctaccatgct gggctagaac agactgactt caaaaatgct 185820 

tcagaggatt ccagaaagca aataaatggc tgggtggaag agaagactga aggtgagtgt 185880 

tctgcagaac tccctgctgt atgtaatgtc agccaggact tgcataaaca gctctgtcaa 185940 

ggtgtaatac tgtcattttt aaagcaaaca caaacctcag ccattgtgct ctgtctctgg 186000 

ttggggcata attcccatat ctgatctatc gttaatacat attagatact ctgtattgca 186060 

acagttgctt acgtaccact gttcaatttg tgttttctaa aggtaaaatt caaaaattgt 186120 

tggcagaggg aattattaac tcaatgacca aacttgtgtt ggtgaatgcc atctacttca 186180 

aaggcaactg ggaagagaag tttgacaaag agcgcacaaa agaaatgcca tttaaaatta 186240 

acaaggtacg ctacgttaat atgctgacaa tacaaaggtc tttgtaatac agaagacaaa 186300 

aattgttcaa gcagatttac ctaaggtagt ctgcatggag ctccctatgc cctgtcccct 186360 

tagtatgaac actctctttg tttagtttct gttaagtttc acataattac taaaaacttt 186420 

aatatcacat atttatttta tactctctct tttttttcct ttactctttt tgtttgtgtt 186480 

tcagttggtg aacttgacta tgtcagtgta aaatctgcat gggcaaaaaa cattcatagg 186540 

ttccaggcag aaaagaactt ccgtgtgtgc agaaatgtct gaatatagca gtcatcttca 186600 

gtcagaatgc ttttctttct gctgtgtttc taccactaaa ttgatagaaa tgaaatgagg 186660 

tgaagaaaaa aaaaaccact ctcctttgaa ggcctccatg cttgactttc ttttgcttct 186720 

aaaagtgcag cagggcaatc gaggaggact ttatgtacta tcattaatag gctacggctg 186780 

ccccttagag gtcaatttca aactctggat gtccacccag gtgtcgtgag agtgaactgc 186840 

taatgtgaat tgcttaagaa ctcacctgct taaaataacc acaatgcaaa attgaagctc 186900 

tagtgcctaa tttcaaactt cagtgttgaa atatatacag gaatgcttga aactgctaat 186960 

accacttttc aaacagggaa taataatatt gctcttgcca tactgtatgc tatagcactt 187020 

agaaaccact gcactgactt ggttcctgtt aggaagggag gttttttatc agtttcccac 187080 

agagatgtca cacaaaaccc aagcttacat tctgcttaga gttttttcct ctccctcctc 187140 
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aggaggcaaa tccagtgctg tttctctggg tacgaggctc agcctagttc tgagattacc 187200 

ctttcctttg cagacacaca tttatttttg aagactgcag tttttgggat gcagatggct 187260 

attggaacaa gttgtaagat gtgagactgg ggaatgctgc cttggctcat caagtaacac 187320 

gctgttagat gtgcaaccac aaacctcttc ccttacaaaa ctaagtggct taaatttcta 187380 

ttttcatcct attgatgact agtcactgat gagctacaga agtcaatgag taggctcaaa 187440 

taagcaatga aaaatccaaa gggcaaagct gaagttttaa gctagttatt ttacagtctg 187500 

tccaggagta gttacttaaa cataccagta gtcttctgag cattctgtga catctatttt 187560 

atctgtgact tttgcacttt gttgtgactc actggatgat tctagcatgc agtgtgggct 187620 

tttctttgct tatccatcat tttcatgtgt cactgattgc tgttgcaaaa tcatttccga 187680 

catattctgt tctcagagtt catggcagtc atttatagaa ttgagtcaga aattcactgt 187740 

ctcaatggtc tttcctttaa aaagaaaaaa cggtgaaggt aaggggaaga agggatttag 187800 

actccacaga aaaggaggaa aataatgtag acaaaagtaa ctgatgctcc atgcaaaaat 187860 

ggagagagat gggggagaaa ctggtagtaa gaagacaaaa gattaacctc tccatgtgcc 187920 

ttttaacaat tcaaagtgat gctaatactt tcctagcatt tttagtggct tagtaaaata 187980 

tttttgttgc cctacgtcag agtaattaga gacacatgga gtgaaatgaa aatatcgaag 188040 

ttgaagttat tttgtattta tttaaagcaa ggaatacagg ctctgcttat taccaacttt 188100 

gtttagagct tgtcactact tctaaagtga gcaaatatgt attcttgctc ctttacccta 188160 

aagcaaattt cacagatatc tccaattaac aattaaatct cagggatcct tacttctcat 188220 

ctcttgcttt acgaaagagt gactgtgcta tactatgtta tgcagtgtac ttagttctct 188280 

gtgcagtcaa atagtaaaaa gccctaagta actagatgcc tgcttcatgt attaggactg 188340 

tcatgccagc ccagtagtaa ctcttagtgt ctctttcatt ttagaatgaa accaaacctg 188400 

tgcagatgat gttcagaaaa ggtaaataca acatgaccta tattggagac ttggagacca 188460 

aaatccttga gatcccttac attggtaatg aactcagtat gatcgttcta ctccctgatg 188520 

caatccagga tgaatctact ggcttggaaa aggtaagtta ttgagctcag tgcaaagaca 188580 

gtttgtgtcc tgccttggaa gagagtttgg tgctgcacat ggattcacag ttcagtttca 188640 

gagctattat atcattgatg ctcaagactg actgaaatgc tccttgtgtt tctgccccta 188700 

aagtggcatg ccatctatta ctactggcca agctatgtgc tgctgtgcta agaggctctg 188760 

aaagaggcct catcagaagc tgtagttatg gtgaagccat agtatgatga gcaccaaatg 188820 

agagggaatt tggggcagct cttaggaagt ccttaccaga atttctacag tttgtcccat 188880 

aggtcatctt agtgaagacc tggcagattg tcactgcccc tctacttgga aacacgctca 188940 

cagaatagtc caggttccct tccgttgtga tgatagaata caagtcatgc tctggcctct 189000 

tgtttttttt tctaatgctg attttaattt aaaaagtgtt gtaagcaggt tttgtcacca 189060 

gcccgtgagc tgaaagatcc tgaaaggctg aagaactggg ttcagtttgt ttggggcctt 189120 

gtcagcagtt ctcccgtgcc tttactccct atatataaaa taaggttttt acaatctgat 189180 
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aatgttttat aaactgaact ttactgtatc taccacgaaa aagaaaacac caaacaagaa 189240 

ttgacctcag ctgaagctgt agtctctagt aagtagaaac ctgtagtgac ttgtgctttt 189300 

gacttgggat cctgtaagct cctgaaaaag atgcatattg catgtatgtg tttacataac 189360 

acacatacac agacaaaagt agagattagt gcaaaactgt cactattctt attttaatta 189420 

cctaatgttg ggttatgttt cgttgctttt tttgttttaa gctggaaaga gaacttacat 189480 

acgagaagct gatggattgg atcaatcctg aaatgatgga cagtacagaa gtgaggctgt 189540 

ctttacccag atttaaactg gaagaaaatt atgatctgaa acccatcctg agcaacatgg 189600 

gaatgcgtga tgcgtttgac ttacggatgg cgaacttctc aggaatctcc tctggtaacg 189660 

agcttgtgct ctctgaagtg gttcacaagt ccttcgtgga ggtcaacgaa gaaggcactg 189720 

aagcagcagc tgccacagca ggagtgatgg tgctccgttg tgctatgatc gttcccgact 189780 

tcactgccga tcatcccttc ctcttcttca tccggcacaa caaaacttcc agtattttgt 189840 

tctgtggcag atattgctct ccctaagaag agagacagaa gagctaccat taacgcagta 189900 

atgtgatttc ttttaggata gaactgctct tttgcactaa ctgcttattt ccactgtgcc 189960 

tgaatcccct tatctggttg tcattttggg cttgcgtaga gtaacaaagc cacttacaca 190020 

tacacagcag ctaccacttg aaacagctgc cttacacttt gcacctaagt ggagttgttt 190080 

tcttgctggc ccaagaaaga tgaacatccc acttgctcag tgaacttcca cctgtcttat 190140 

attttctatt gcactttgct tttgtgtggc caccaggtag caaggtgaca aagagaaaag 190200 

aagtggattt tgtttctgac tatagtggaa gatatcttat gctctgctcc ccatttttct 190260 

tcctctcccc acttattttt aactttttct ttaatgtttt gataatagag ggagatgaaa 190320 

ggaggctttg gcgacctatt tgtaagagtt actaagcatc tgcactagac agaggtttta 190380 

ttataactgg atagcactta cacaaggatg ggaataaaag tatgtctgta acaaatgacc 190440 

ttagaggttt tcatggagta cggattctta tcttaacacc acatgtgcca cctgggaata 190500 

ttagctatca ctcacctact tcattagtct tttaaaaaaa gaatgttttt aaaaacaaac 190560 

aaacaaaaaa aaacccatag atgcctatgt agtatttaag tgacagagct ttatttttgt 190620 

ttttcagtct ttatatgttt ttttcctatt ctgggtttgt aaagcatctt tgttaatctg 190680 

aatgccaaag gtttcttaac gcagtgattt acgtgttttg ctgttcttga aagaataaac 190740 

aaatttgttg tgagtgctgt gggcattgcc cataaatttt gtggggtttt tttttttcat 190800 

ggctactgta aagaaaacaa gcaatcaact ttcgtgtagc ttatgcagaa ttcattgctt 190860 

aacagaggct tttctgaatg ctgcaagacc aagatgctta cctggattac gatggagttt 190920 

aggtttttac cttcgaagga ttcatagcaa ggagtctttg aggcaaaggc tcaagggatt 190980 

ttaaagacta tctggttcca actccctgct gtgggcatgg ctgcccaggg ctccctcctc 191040 

cctggccttc gacacctcca aggatggggc acccacagct tccctgggca gctgtgccag 191100 

tgcctcacca acctccaagt gaaaaagatc cccctgacat ctaattgaaa gttcctttca 191160 

tttattttaa agccattccc tcttattcca tcactatcag accaaggctc cctttatgta 191220 
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ttggaaggcc acaacaaggt ccccttggag tcttctcgag gctgaacaag ctaagtgctt 191280 

gtttccagat ggggtactgc tgctgtgggt gcaactcctt ggccccaggc caaggagtgt 191340 

gccatgcctc agatgcagcg attagtcacc atttggggtg aggaagctgc caaagtgctg 191400 

cctgtcagac cgatgctcag tcagggctga gagcagcagt gggtagaagg gaagtgggca 191460 

gcctctgctc ccagtgcatt gtctgggaag ggggtggtag caagatgaaa agtagaaatt 191520 

tttctgaccc ttcctacgtg tccaggctgc tgctggagtg tattcatggt gctatgctta 191580 

aagtgaaagc aaaagcgtgc ttgtctaatt tgcttctttt ctaaattgaa aaggaaagta 191640 

atcacattaa cgtctaccat aaagcagaga gaagctgcca gaaagcttga gagaagctag 191700 

aagcagccat atctacaaat cccagtgcaa acaagaagga gggatcccag ctgcacaagc 191760 

aggaaggcag gaaggtttta cagcactgtc tgccgccagc ctttgcgtaa ccatctgccc 191820 

gccccagcat tgcacctttc aacccactcc cagagacctc acagctccca gtggtcctag 191880 

ctccagctta ctgctggctg ctctcctcct ggtttgatcc tccctagcag ctgccaagca 191940 

tcacaggagg taagtgtgtg cttgctgtgc ctctgcattt tgcagcctga aatgaatcca 192000 

gcccttggaa ctcgcactag ggcatcgagg agtgctttct gaagccttca ctgaaacttt 192060 

tatttttcag ctgcagccat ggagagcctg agcaatgcca acagcaggtt tgcacttgat 192120 

ctcttccgaa aggttaatga gacgaaccca tcaggaaaca ttttcttctc ccctctcagt 192180 

atttctactg ctctggccat ggtcctcctc gggtccagag gtaatacaga gacacaggtg 192240 

ctgaaggtca gcagcatttt cgcttgtttt attaaaatta aatgttgttc agttttagag 192300 

acaaggcaag gggaggaggg cgttatttgc gtgagcttgg ggcaaggttc ctgtcactcc 192360 

tgctgactct tcccccctgc tgccacctgc ctgctgcact ccagagccct cctcttgtgc 192420 

tcactgatag cccttctttc tcacttcatt tgggttaatt gatgaatctg gaaactaatt 192480 

tcactgattt atcagtctta atttaaaatc gattagcatc tccagcagca agtctttact 192540 

agagcttgtg ataggacatg ggggaacagc attaaacaaa aagggaagat ttaggcaaga 192600 

tatttgttag aggaagtttt tccactgaga aagtggtgag gtggtggcac agctgcccag 192660 

ggaagctgag ggtgccccat ccctggaggt gttcaagacc aggttggatg gggccctggg 192720 

cagcctgagc tggtgggtgg ctgccctgcc cacagcaagg cagtggaact gggtgggctt 192780 

taagttgaat tccagcccaa ccccattcta tgattctatg agccttttcc acagagaact 192840 

attgttttgc aatgtataca tacataatgg tatacatagt aatgctaagt gtatcttata 192900 

aataaaaaat aaaatataga gctgtattat tctaaggctg acaactgtta caatacatgg 192960 

tgatgttacc caagacccag tgttatagca gccaagcacc cagtatttct gaggagcaga 193020 

actcacgtgt ccattctcat ggtatcctta ggttgagcag cagaggttaa atgaaaatgg 193080 

tgtggctcct ttactggggg ctttgttgtg gacccagctc atcaatcctt tcccactctc 193140 

cacaacagca gttgcaaact gcaaattctc atgtaggtag cagtgccaat tccctctcag 193200 

actcatgttc aaaagggacc ctgcctcttt tttaatttgc aaggcaaaca cctgctagtg 193260 
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caaggggaag tatgaaagaa ttgtcgctgt agttcctatt aacttatttg ccctatgatt 193320 

aagttcacct ttgtattccg aactttagga agaacttgtt tagaccatta actgctgcca 193380 

ttctttgtga aaagactata aaactgaatc actgcttgta gaaacagact ttgaacatac 193440 

attccttata actcaactgt cagccccacc caggaagaat ctactgagag cagaaataat 193 500 

gcaagagaag catagggaag ttggagatag aaggttggga tgaatggttg gactgggtga 193 560 

tcctgtgtgt cttttccaac ttcagtgatt ctatgattct aaggtgtttc agcacagtaa 193620 

ccttctgtaa tgcacattcc catggtataa tgtttaattg atgagaacat cagttaatta 193680 

aggagatgat gactgatgag tgtgaagggt gtttataagc atgcagaaat ccatttctgg 193740 

gatcataatc ctaccttaag ttggaatcat agagtacacc acggtggagg ggatccatga 193800 

ggatccagct ccacacagca ccacccacta tggtagatcc tgctgcccaa cctgcacacc 193860 

ttggctagtc agcttccctt caggtatctg tatgcacgct ttcatattat aacagctttt 193920 

aattttaagg tgatagttgt ctgtagaagc acttatattt tcataaaacc aaaggttata 193980 

gctctcacat tttcctaaca cctcaccttc cctgagtgct cagacaagct cagtagtcca 194040 

cggaggaaaa acatgcagac agcaccctat taggactctg gatcacaatt aacagcttca 194100 

gctgtggcta actgtattca gctactgctt tacaagtgac atggctggca cagcactaag 194160 

ggacagtttc acttgttctt tgatggttac agctttcagc ttctttctgc ttttgttttt 194220 

caacttaact accaaacaaa taccatacag atatgctgca tgttctctat aaatacagca 194280 

ttagcagtag ttagctcatc tctttcattt cagacgtttc attttgatga agttgaaaat 194340 

atacactcaa gattccgggc tctgactgca gacatcaaca gaagggattc ttcctgtctc 194400 

ctacggattg ccaaccggct ttatggagag aagtcctaca gctttctgcc ggtatgggta 194460 

cacagaccat agctgtgtgg tggaacctgg ggggaggctt tgtaacttca tcatctgttg 194520 

ctctcctgcc tccagaacgc gccccatagc aaaaatatca caccagcaag tccagatgtc 194580 

aaaactatct ttctgcatca ataagcagca tagctcaggt gttgctgtct ttataggaat 194640 

gcagccattt gagtatttga ggtaaaaaca tgactagaca tctaaaagtt accaggcagt 194700 

cagtacgagt gttgtacaca tgcctataga tgcagaaatg catatgcatc tggacatcct 194760 

aaaggatacg cctagaggat attacataac aaatcccttt ctttgatagt tcagttctgc 194820 

tgctttgggg ctcaagagaa attgcaagcc atgtaggttc ttagcttaga gtacagatta 194880 

gcaatgcccc attcctctgt ctgttgtttt ttaggctttt cattgctcta gtactatatt 194940 

acttaaaaca tttttgaaaa catttctctg gggggagatt gccatcatgt ctcaacagca 195000 

tgcctcttta caagggaact gtacctctgc atctatttag gtactgctat ttttatccct 195060 

ctccagctct ttctgggagt ttttgttttc ttagtcaagc tt 195102 

<210> 2 
<211> 285 
<212> DNA 
<213> SV40 
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<400> 2 

aaagtctaga gtcggggcgg ccggccgctt cgagcagaca tgataagata cattgatgag 60 

tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat 120 

gctattgctt tatttgtaac cattataagc tgcaataaac aagttaacaa caacaattgc 180 

attcatttta tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac 240 

ctctacaaat gtggtaaaat cgataaggat ccgtcgagcg gccgc 285 

<210> 3 

<211> 498 

<212> DNA 

<213> Artificial 

<220> 

<223> Human interferon alpha2b codon optimized for avian expression 

<220> 

<221> mi sc_f eatu re 

<222> (1) . . (498) 

<400> 3 



tgcgatctgc 


ctcagaccca 


cagcctgggc 


agcaggagga 


ccctgatgct 


gctggctcag 


60 


atgaggagaa 


tcagcctgtt 


tagctgcctg 


aaggataggc 


acgattttgg 


ctttcctcaa 


120 


gaggagtttg 


gcaaccagtt 


tcagaaggct 


gagaccatcc 


ctgtgctgca 


cgagatgatc 


180 


cagcagatct 


ttaacctgtt 


tagcaccaag 


gatagcagcg 


ctgcttggga 


tgagaccctg 


240 


ctggataagt 


tttacaccga 


gctgtaccag 


cagctgaacg 


atctggaggc 


ttgcgtgatc 


300 


cagggcgtgg 


gcgtgaccga 


gacccctctg 


atgaaggagg 


atagcatcct 


ggctgtgagg 


360 


aagtactttc 


agaggatcac 


cctgtacctg 


aaggagaaga 


agtacagccc 


ctgcgcttgg 


420 


gaagtcgtga 


gggctgagat 


catgaggagc 


tttagcctga 


gcaccaacct 


gcaagagagc 


480 


ttgaggtcta 


aggagtaa 










498 
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